Netzwerke | Networks - Part 12
posted by PP on 2006/04/04 13:17
[ Netzwerke | Networks ]
What is the invisible web? And: Peer Review in the Google Age - two interesting presentations on E-LIS I could recommend:- Natalia Arroyo (2004): What is the invisible web? A crawler perspective. [.pdf]
- Margaret Dominy, Jean-Claude Bradley and Jay Bhatt (2006): Peer Review in the Google Age: Is technology changing the way science is done and evaluated? [.ppt]
The invisible Web, also known as the deep Web or dark matter, is an important problem for Webometrics due to difficulties of conceptualization and measurement. The invisible Web has been defined to be the part of the Web that cannot be indexed by search engines, including databases and dynamically generated pages. Some authors have recognized that this is a quite subjective concept that depends on the point of view of the observer: what is visible for one observer may be invisible for others. In the generally accepted definition of the invisible Web, only the point of view search engines has been taken into account. Search engines are considered to be the eyes of the Web, both for measuring and searching.
In addition to commercial search engines, other tools have also been used for quantitative studies of the Web, such as commercial and academic crawlers. Commercial crawlers are programs developed by software companies for other purposes than Webometrics, such as Web sites management, but might also be used for crawling Web sites and reporting on their characteristics (size, hypertext structure, embedded resources, etc). Academic crawlers are programs developed by academic institutions for measuring Web sites for Webometric purposes.
In this paper, Sherman and Price’s "truly invisible Web" is studied from the point of view of crawlers. The truly invisible Web consists of pages that cannot be indexed for technical reasons. Crawler parameters are significantly different to search engines, due to different design purposes resulting in different technical specifications. In addition, huge differences among crawlers on their coverage of the Web have been demonstrated in previous investigations. Both aspects are clarified though an experiment in which different Web sites, including diverse file formats and built with different types of Web programming, are analyzed, on a set date, with seven commercial crawlers (Astra SiteManager, COAST WebMaster, Microsoft Site Analyst, Microsoft Content Analyzer, WebKing, Web Trends and Xenu), and an academic crawler (SocSciBot). Each Web site had been previously copied to a hard disk, using a file-retrieving tool, in order to compare them with the data obtained by crawlers. The results are reported and analyzed in detail to produce a definition and classification of the invisible Web for commercial and academic crawlers.
Abstract for Margaret Dominy, Jean-Claude Bradley and Jay Bhatt (2006): Peer Review in the Google Age: Is technology changing the way science is done and evaluated?:
Peer review in science and technology has been the subject of significant controversy. In a world where finding information is laborious, the knowledge that a document is from a "“trusted source" is valuable. This presentation explores whether coming of Google and other web based search engines have impacted how science is done, peer reviewed and published and whether the present peer review process is necessary and maintains the quality and focus of a journal and articles published. With increasing number of Open Access Institutional Repositories worldwide and indexed by Google, more and more scientific literature is now visible than before and hence resulting in increased worldwide access to scientific literature. Quality blogs and wikis have speeded up the process of scientific communication through informal peer reviews while the present system of formal peer review still attempts to maintain quality of a reserach paper although it tends to slow down the process of scientific communication. More debate and discussion on issues mentioned will further shed light on this important subject of peer reviews in Google environment.
< previous Posting next >
<< previous Topic next >>
Senior Editor
Seitenwechsel. Geschichten vom Fußball. Hgg. v. Samo Kobenter u. Peter Plener. Wien: Bohmann 2008, 237 pp.
(Weitere Informationen hier)
(Weitere Informationen hier)
Transcarpathica. Germanistisches Jahrbuch Rumänien 3-4/2004-2005. Hgg. v. Andrei Corbea-Hoisie u. Alexander Rubel. Bukarest/Bucuresti: Editura Paideia 2008, 336 pp.
[Die online-Fassung meines Einleitungsbeitrags "Thesen zur Bedeutung der Medien für Erinnerungen und Kulturen in Mitteleuropa" findet sich auf Kakanien revisited (Abstract / .pdf).]
[Die online-Fassung meines Einleitungsbeitrags "Thesen zur Bedeutung der Medien für Erinnerungen und Kulturen in Mitteleuropa" findet sich auf Kakanien revisited (Abstract / .pdf).]
Antworten