ACM Home Page
Please provide us with feedback. Feedback
Implementation and evaluation of a quality-based search engine
Full text PdfPdf (354 KB)
Source Conference on Hypertext and Hypermedia archive
Proceedings of the seventeenth conference on Hypertext and hypermedia table of contents
Odense, Denmark
SESSION: Education and evaluation table of contents
Pages: 73 - 84  
Year of Publication: 2006
ISBN:1-59593-417-0
Author
Thomas Mandl  University of Hildesheim, Hildesheim, Germany
Sponsors
ACM: Association for Computing Machinery
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 21,   Downloads (12 Months): 148,   Citation Count: 3
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1149941.1149957
What is a DOI?

ABSTRACT

In this paper, an approach for the implementation of a quality-based Web search engine is proposed. Quality retrieval is introduced and an overview on previous efforts to implement such a service is given. Machine learning approaches are identified as the most promising methods to determine the quality of Web pages. Features for the most appropriate characterization of Web pages are determined. A quality model is developed based on human judgments. This model is integrated into a meta search engine which assesses the quality of all results at run time. The evaluation results show that quality based ranking does lead to better results concerning the perceived quality of Web pages presented in the result set. The quality models are exploited to identify potentially important features and characteristics for the quality of Web pages.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
 
3
Barabási, A.-L. Linked: The New Science of Networks. Perseus, 2002
 
4
Beck, S. Evaluation Criteria: The Good, The Bad & The Ugly: or, Why It's a Good Idea to Evaluate Web Sources. (1997) http://lib.nmsu.edu/instruction/evalcrit.html
5
6
7
 
8
 
9
 
10
11
12
13
 
14
 
15
De la Cruz, T., Mandl, T. and Womser-Hacker, C. Cultural Dependency of Quality Perception and Web Page Evaluation Guidelines: Results from a Survey. In Designing for Global Markets 7: Proc. Seventh International Workshop on Internationalization of Products and Systems (IWIPS 2005) Amsterdam, The Netherlands. 15--27
16
 
17
18
19
 
20
Foltz, P.W., Klintsch, W. and Landauer, T.K. The Measurement of Textual Coherence with Latent Semantic Analysis. In Discourse Processes, vol. 25 (2&3) (1998) 285--307.
 
21
Henzinger, M. Link Analysis in Web Information Retrieval. In Bulletin of the IEEE Computer Society Technical Committee on Data Engineering, vol. 23 (3) (2000) 3--8.
22
 
23
24
 
25
Jensen, N., Hackl, R., Mandl, T. and Strötgen, R. Web Retrieval Experiments with the EuroGOV Corpus at the University of Hildesheim. In Accessing Multilingual Information Repositories: 6th Workshop of the Cross-Language Evaluation Forum, CLEF 2005, Vienna, Austria, Revised Selected Papers. Springer {LNCS 4022} (2006)
 
26
Klas, C.-P. Fuhr, N. A new effective approach for categorizing web documents. In Proceedings of the 22nd BCS-IRSG Colloquium on IR Research (2000)
 
27
Langville, A. and Meyer, C. Deeper Inside PageRank. In Internet Mathematics 1 (3) (2003) 335--380.
 
28
Mandl, T. Web Link Behavior and Consequences for Connectivity Based Authority Measures. In The Twelfth International World Wide Web Conference. 20-24 May 2003, Budapest. http://www2003.org/cdrom/papers/poster/p204/p204-mandl.html
 
29
Mandl, T. Automatische Bewertung der Qualität von Web-Seiten im Information Retrieval. Konstanz: Universitätsverlag (2006) to appear.
 
30
31
 
32
Pennock, D., Flake, G., Lawrence, S., Glover, E. and Giles, L. Winners don't take all: Characterizing the competition for links on the web. In Proc. National Academy of Sciences 99 (8) 2002. 5207--5211
 
33
Thelwall, M. The top 100 linked-to pages on UK university web sites: high in-link counts are not usually associated with quality scholarly content. In Journal of Information Science 28 (6) 2002. 483--491.
34
 
35
36
37
 
38
Yang, K. Combining text- and link-based retrieval methods for Web IR. In Proc. the Ninth Text REtrieval Conf (TREC 9) (2000)
39