Invisible Web, or Cloaked Web or Deep Web: The hidden treasure

Many have the naive expectation that they can locate anything on the World Wide Web using search engines όπως την Google ή το Yahoo ή την Ask.com ή το Bing. Η αλήθεια είναι ότι όλες αυτές οι μηχανές αναζήτησης έχουν ευρετηριάσεις από το 10% του συνόλου του ιστού. Το υπόλοιπο 90% ονομάζεται "Invisible Web", ή "Cloaked Web" the "Deep Web."

This means that there are huge amounts of data that are availablei to the public, but they remain hidden from the search engines that everyone knows.

Ίσως σας είναι δύσκολο να κατανοήσετε πως δισεκατομμύρια ιστοσελίδες δεν μπορούν να εμφανιστούν στα αποτελέσματα της Google. Όμως υπάρχουν. Τα ρομπότ 'spiders' που σαρώνουν και αρχειοθετούν το world wide web έχουν περιορισμένες δυνατότητες.

To better understand, let's start with some numbers about the size of services offered by: Google.com, Yahoo.com, Cyberatlas, and MIT. The statistics data it's from summer 2013:

Google.com has 40 billions of public webpages in its archives. 100 + Two of these are static websites and publicly available. These pages can easily be found by Google and other search engines.
11 + two-static pages are hidden by the public after they have declared that they contain private content or are on the intranet. These are corporate pages that are open only to employees of these companies.
450 + billions of pages have databases that are completely invisible to Google. For example, government databases with tax information,

Google is considered to have the best search database today. The company's spiders list millions of web pages every week.

So if Google has saved only one 8-10% of the World Wide Web and the other search engines have even smaller databases, then where does the remaining 92% of the content on the Internet hide?

Το "Invisible Web" (ή "Deep Web" ή "Cloaked Web") είναι το περιεχόμενο που δεν εμφανίζεται στις μηχανές αναζήτησης.
More specifically: The Invisible Web consists of 220+ billion web pages that are not stored as static web pages. The Invisible Web consists of on-demand pages and databases. That is, pages that exist only as reports of changing data. As of August 2007, robotic spiders had not advanced enough to read these private databases. Only people can access it, and only if they have the knowledge.

Technical terminology:

"Spider": An artificial intelligence program, or robot, that has been sent to read millions of static web pages on the public Internet. The information collected by Spiders is stored in databases used by search engines.

"Database-Driven Web Content": websites that exist only temporarily, and are only created when readers request answers from a large database. These temporary webpages are dynamic, and usually cannot be bookmarked. They usually have extremely large URLs.

The Invisible Web contains Dynamic Web Pages. This means that a database creates a temporary page to answer your question! Good one huh;

Contents hide

How can I use the Invisible Web?

Humanities

Special foundations of the US government

Health and Science

Mega-Portal

How can I use the Invisible Web?

There are too many who ask exactly the same. Let's look at some valuable databases below.

Humanities

Voice of the Shuttle: Started 1994, and is one of the oldest and largest human data bases on the Web.

Special foundations of the US government

University of Michigan Government Documents Center: You will find a great deal of data, surveys, statistics, and many of the high levels of the US government. The databases offered include Arts, Health Sciences, Social Sciences and International Studies.
USA.gov: A portal station for many agencies of the United States government. It includes government positions, services, and information on finding grants, loans and financial aid.

Health and Science

PsycNET: Use the American Psychological Association database to find excerpts and entire journals on various psychology topics.
Scirus: A search tool that works exclusively for scientific information. The amazing search tool has hundreds of millions of scientific, and academic papers to help researchers from around the world.
Healthfinder: It contains information from over a thousand different health databases on the internet.
RXList: If you are looking for reliable information about medicines, then this database is for you.

Mega-Portal

The University of California, Riverside maintains it InfoMine, an incredible source of knowledge that at the last count contained over 100.000 links and access to hundreds, if not thousands, of databases.

Γενικά υπάρχουν πάρα πολλές, ιστοσελίδες που έχουν συσταθεί για να φέρνουν στην επιφάνεια δεδομένα από το Invisible Web. Η CompletePlanet.com είναι μία από αυτές. Περιέχει "πάνω από 70.000 βάσεις δεδομένων."

Most of the information about the invisible Web is kept by academic institutions. There are "academic gates" that can help you find this information. To find almost every educational resource on the web, simply type the following term into your favorite search engine:

site:.edu "θέμα που αναζητώ"

Your search will return results for edu sites only. If you want to search for something from a specific university use the university URL in your search:

site:www.πενεπιστήμιο.gr "θέμα που αναζητώ"
This is just the tip of the iceberg. All that we have mentioned in this article is just beginning to touch the huge resources available on the Invisible Web. As time passes, the Invisible Web becomes larger.