|
ABSTRACT
The more the web grows, the harder it is for users to find the information they need. As a result, it is even more difficult to identify when documents are related. To find out that two or more documents are in fact related, users have to navigate by the documents in carry out an analysis about their content. This paper presents an infrastructure allowing the use of latent semantic analysis and open hypermedia concepts in the automatic identification of relationships among web pages. Latent Semantic Analysis has been proposed by the information retrieval community as an attempt to organize automatically text objects into a semantic structure appropriate for matching. In open hypermedia systems, links are managed and stored in a special database, a linkbase, which allows the addition of hypermedia functionality to a document without changing the original structure and format of the document. We first present two complementary link-related efforts: an extensible latent semantic indexing service and an open linkbase service. Leveraging off those efforts, we present an infrastructure that identifying latent semantic links within web repositories and makes them available in an open linkbase. To demonstrate by example the utility of our open infrastructure, we built an application presenting a directory of semantic links extracted from web sites.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
G. D. Abowd, M. G. C. Pimentel, B. Kerimbaev, Y. Ishiguro, and M. Guzdial. Anchoring discussion in lecture: an approach to collaboratively extending classroom digital media. In Proceedings of the Computer Support for Collaborative Learning (CSCL) Conference, pages 11--19, Stanford University, 1999
|
 |
3
|
|
 |
4
|
|
 |
5
|
|
| |
6
|
R. F. Bulcao Neto. WLS: An XML-based open hypermedia service for the Web. Msc thesis, Instituto de Ciencias Matematicas e de Computacao da USP, Sao Carlos, Sao Paulo, 2001. In Portuguese
|
| |
7
|
L. Carr, D. C. DeRoure, H. C. Davies, and W. Hall. The distribuited link service: A tool for publishers, authors and readers. In Proceedings of the Fourth International World Wide Web, pages 647--656. ACM Press, 1995
|
 |
8
|
Leslie Carr , Wendy Hall , Sean Bechhofer , Carole Goble, Conceptual linking: ontology-based open hypermedia, Proceedings of the 10th international conference on World Wide Web, p.334-342, May 01-05, 2001, Hong Kong, Hong Kong
[doi> 10.1145/371920.372084]
|
| |
9
|
H. Davis, A. Lewis, and A. Rizk. OHP: a draft proposal for a standard Open Hypermedia Protocol. In Proceedings of the Second Workshop on Open Hypermedia Systems (Hypertext'96), pages 27--53. ACM Press, 1996
|
| |
10
|
S. DeRose, E. Maler, and R. Daniel. XML Pointer Language (XPointer), Last Call Working Draft. on-line in World Wide Web, 2001. URL: http://www.w3.org/TR/xptr
|
 |
11
|
S. T. Dumais , G. W. Furnas , T. K. Landauer , S. Deerwester , R. Harshman, Using latent semantic analysis to improve access to textual information, Proceedings of the SIGCHI conference on Human factors in computing systems, p.281-285, May 15-19, 1988, Washington, D.C., United States
[doi> 10.1145/57167.57214]
|
 |
12
|
Samhaa R. El-Beltagy , Wendy Hall , David De Roure , Leslie Carr, Linking in context, Proceedings of the twelfth ACM conference on Hypertext and Hypermedia, August 14-18, 2001, Århus, none, Denmark
[doi> 10.1145/504216.504257]
|
| |
13
|
Andrew M. Fountain , Wendy Hall , Ian Heath , Hugh C. Davis, MICROCOSM: an open model for hypermedia with dynamic linking, Hypertext: concepts, systems and applications, Cambridge University Press, New York, NY, 1992
|
 |
14
|
G. W. Furnas , S. Deerwester , S. T. Dumais , T. K. Landauer , R. A. Harshman , L. A. Streeter , K. E. Lochbaum, Information retrieval using a singular value decomposition model of latent semantic structure, Proceedings of the 11th annual international ACM SIGIR conference on Research and development in information retrieval, p.465-480, May 1988, Grenoble, France
[doi> 10.1145/62437.62487]
|
 |
15
|
|
| |
16
|
|
| |
17
|
|
 |
18
|
|
| |
19
|
|
 |
20
|
|
 |
21
|
|
 |
22
|
|
 |
23
|
|
| |
24
|
A. M. M. Miotto and R. P. M. Fortes. Uma visao geral das caracteristicas de sistemas hipermidia abertos. Technical Report 124, Instituto de Ciencias Matematicas e de Computaccao (ICMC-USP), Novembro 2000. 22 p. In Portuguese
|
| |
25
|
mnoGoSearch Group. mnoGoSearchtm web search engine software. Internet, 2001. URL: http:/www.mnogosearch.ru
|
 |
26
|
Christos H. Papadimitriou , Hisao Tamaki , Prabhakar Raghavan , Santosh Vempala, Latent semantic indexing: a probabilistic analysis, Proceedings of the seventeenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems, p.159-168, June 01-04, 1998, Seattle, Washington, United States
[doi> 10.1145/275487.275505]
|
 |
27
|
Maria da Graça Pimentel , Gregory D. Abowd , Yoshihide Ishiguro, Linking by interacting: a paradigm for authoring hypertext, Proceedings of the eleventh ACM on Hypertext and hypermedia, p.39-48, May 30-June 03, 2000, San Antonio, Texas, United States
[doi> 10.1145/336296.336315]
|
| |
28
|
M. G. C. Pimentel, Y. I. B. Kerimbaev, G. D. Abowd, and M. Guzdial. Supporting long-term educational activities through dynamic Web interfaces. Interacting With Computers Journal, 13:353--374, 2001
|
| |
29
|
M. G. C. Pimentel, A. A. Macedo, and G. D. Abowd. Linking homogeneous Web-based repositories. In Proceedings of International Workshop on Information Integration on the Web, pages 35--42, Rio de Janeiro-Brazil, 2001. URL: http:/www.cos.ufrj.br/wiiw/schedule.html
|
 |
30
|
Morgan N. Price , Gene Golovchinsky , Bill N. Schilit, Linking by inking: trailblazing in a paper-like hypertext, Proceedings of the ninth ACM conference on Hypertext and hypermedia : links, objects, time and space---structure in hypermedia systems: links, objects, time and space---structure in hypermedia systems, p.30-39, June 20-24, 1998, Pittsburgh, Pennsylvania, United States
[doi> 10.1145/276627.276631]
|
| |
31
|
G. Salton. A blueprint for automatic indexing. ACM Conference on Research and Development in Information Retrieval Forum, 16(2):22--38, 1981
|
 |
32
|
|
 |
33
|
|
 |
34
|
Ilmério Silva , Berthier Ribeiro-Neto , Pável Calado , Edleno Moura , Nívio Ziviani, Link-based and content-based evidential information in a belief network model, Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval, p.96-103, July 24-28, 2000, Athens, Greece
[doi> 10.1145/345508.345554]
|
 |
35
|
|
| |
36
|
L. C. Tai. Architecture support for content-based hypermedia. In Proceedings of the Second Workshop on Open Hypermedia Systems (Hypertext'96), pages 1--5, Wahsington D.C., USA, Maio 1996. ACM Press
|
 |
37
|
|
 |
38
|
|
 |
39
|
|
CITED BY 10
|
|
Renato Bulcao Neto , Claudia Akemi Izeki , Maria da Graça Pimentel , Renata Pontin Fortes , Khai Nhut Truong, An open linking service supporting the authoring of web documents, Proceedings of the 2002 ACM symposium on Document engineering, November 08-09, 2002, McLean, Virginia, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Alessandra A. Macedo , Laércio Baldochi, Jr. , José A. Camacho-Guerrero , Renan G. Cattelan , Maria Da Pimentel, Automatically linking live experiences captured with a ubiquitous infrastructure, Multimedia Tools and Applications, v.37 n.2, p.93-115, April 2008
|
|
|
|
|
|
|
|
|
José A. Camacho-Guerrero , Alex A. Carvalho , Maria G. C. Pimentel , Ethan V. Munson , Alessandra A. Macedo, Clustering as an approach to support the automatic definition of semantic hyperlinks, Proceedings of the eighteenth conference on Hypertext and hypermedia, September 10-12, 2007, Manchester, UK
|
|