|
ABSTRACT
In this paper, an approach for the implementation of a quality-based Web search engine is proposed. Quality retrieval is introduced and an overview on previous efforts to implement such a service is given. Machine learning approaches are identified as the most promising methods to determine the quality of Web pages. Features for the most appropriate characterization of Web pages are determined. A quality model is developed based on human judgments. This model is integrated into a meta search engine which assesses the quality of all results at run time. The evaluation results show that quality based ranking does lead to better results concerning the perceived quality of Web pages presented in the result set. The quality models are exploited to identify potentially important features and characteristics for the quality of Web pages.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
 |
2
|
|
| |
3
|
Barabási, A.-L. Linked: The New Science of Networks. Perseus, 2002
|
| |
4
|
Beck, S. Evaluation Criteria: The Good, The Bad & The Ugly: or, Why It's a Good Idea to Evaluate Web Sources. (1997) http://lib.nmsu.edu/instruction/evalcrit.html
|
 |
5
|
Allan Borodin , Gareth O. Roberts , Jeffrey S. Rosenthal , Panayiotis Tsaparas, Link analysis ranking: algorithms, theory, and experiments, ACM Transactions on Internet Technology (TOIT), v.5 n.1, p.231-297, February 2005
[doi> 10.1145/1052934.1052942]
|
 |
6
|
|
 |
7
|
|
| |
8
|
|
| |
9
|
|
| |
10
|
|
 |
11
|
Soumen Chakrabarti , Mukul M. Joshi , Kunal Punera , David M. Pennock, The structure of broad topics on the web, Proceedings of the 11th international conference on World Wide Web, May 07-11, 2002, Honolulu, Hawaii, USA
[doi> 10.1145/511446.511480]
|
 |
12
|
|
 |
13
|
Ed H. Chi , Adam Rosien , Gesara Supattanasiri , Amanda Williams , Christiaan Royer , Celia Chow , Erica Robles , Brinda Dalal , Julie Chen , Steve Cousins, The bloodhound project: automating discovery of web usability issues using the InfoScentπ simulator, Proceedings of the SIGCHI conference on Human factors in computing systems, April 05-10, 2003, Ft. Lauderdale, Florida, USA
[doi> 10.1145/642611.642699]
|
| |
14
|
|
| |
15
|
De la Cruz, T., Mandl, T. and Womser-Hacker, C. Cultural Dependency of Quality Perception and Web Page Evaluation Guidelines: Results from a Survey. In Designing for Global Markets 7: Proc. Seventh International Workshop on Internationalization of Products and Systems (IWIPS 2005) Amsterdam, The Netherlands. 15--27
|
 |
16
|
|
| |
17
|
Stephen Dill , Ravi Kumar , Kevin S. McCurley , Sridhar Rajagopalan , D. Sivakumar , Andrew Tomkins, Self-similarity in the Web, Proceedings of the 27th International Conference on Very Large Data Bases, p.69-78, September 11-14, 2001
|
 |
18
|
|
 |
19
|
B. J. Fogg , Jonathan Marshall , Othman Laraki , Alex Osipovich , Chris Varma , Nicholas Fang , Jyoti Paul , Akshay Rangnekar , John Shon , Preeti Swani , Marissa Treinen, What makes Web sites credible?: a report on a large quantitative study, Proceedings of the SIGCHI conference on Human factors in computing systems, p.61-68, March 2001, Seattle, Washington, United States
[doi> 10.1145/365024.365037]
|
| |
20
|
Foltz, P.W., Klintsch, W. and Landauer, T.K. The Measurement of Textual Coherence with Latent Semantic Analysis. In Discourse Processes, vol. 25 (2&3) (1998) 285--307.
|
| |
21
|
Henzinger, M. Link Analysis in Web Information Retrieval. In Bulletin of the IEEE Computer Society Technical Committee on Data Engineering, vol. 23 (3) (2000) 3--8.
|
 |
22
|
|
| |
23
|
|
 |
24
|
|
| |
25
|
Jensen, N., Hackl, R., Mandl, T. and Strötgen, R. Web Retrieval Experiments with the EuroGOV Corpus at the University of Hildesheim. In Accessing Multilingual Information Repositories: 6th Workshop of the Cross-Language Evaluation Forum, CLEF 2005, Vienna, Austria, Revised Selected Papers. Springer {LNCS 4022} (2006)
|
| |
26
|
Klas, C.-P. Fuhr, N. A new effective approach for categorizing web documents. In Proceedings of the 22nd BCS-IRSG Colloquium on IR Research (2000)
|
| |
27
|
Langville, A. and Meyer, C. Deeper Inside PageRank. In Internet Mathematics 1 (3) (2003) 335--380.
|
| |
28
|
Mandl, T. Web Link Behavior and Consequences for Connectivity Based Authority Measures. In The Twelfth International World Wide Web Conference. 20-24 May 2003, Budapest. http://www2003.org/cdrom/papers/poster/p204/p204-mandl.html
|
| |
29
|
Mandl, T. Automatische Bewertung der Qualität von Web-Seiten im Information Retrieval. Konstanz: Universitätsverlag (2006) to appear.
|
| |
30
|
|
 |
31
|
|
| |
32
|
Pennock, D., Flake, G., Lawrence, S., Glover, E. and Giles, L. Winners don't take all: Characterizing the competition for links on the web. In Proc. National Academy of Sciences 99 (8) 2002. 5207--5211
|
| |
33
|
Thelwall, M. The top 100 linked-to pages on UK university web sites: high in-link counts are not usually associated with quality scholarly content. In Journal of Information Science 28 (6) 2002. 483--491.
|
 |
34
|
|
| |
35
|
|
 |
36
|
|
 |
37
|
Gui-Rong Xue , Hua-Jun Zeng , Zheng Chen , Wei-Ying Ma , Hong-Jiang Zhang , Chao-Jun Lu, Implicit link analysis for small web search, Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval, July 28-August 01, 2003, Toronto, Canada
[doi> 10.1145/860435.860448]
|
| |
38
|
Yang, K. Combining text- and link-based retrieval methods for Web IR. In Proc. the Ninth Text REtrieval Conf (TREC 9) (2000)
|
 |
39
|
|
CITED BY 3
|
|
Meiqun Hu , Ee-Peng Lim , Aixin Sun , Hady Wirawan Lauw , Ba-Quy Vuong, On improving wikipedia search using article quality, Proceedings of the 9th annual ACM international workshop on Web information and data management, November 09-09, 2007, Lisbon, Portugal
|
|
|
|
|
|
Yusuke Yanbe , Adam Jatowt , Satoshi Nakamura , Katsumi Tanaka, Can social bookmarking enhance search in the web?, Proceedings of the 2007 conference on Digital libraries, June 18-23, 2007, Vancouver, BC, Canada
|
|