| AJAXSearch: crawling, indexing and searching web 2.0 applications |
| Full text |
Pdf
(468 KB)
|
Source
|
Proceedings of the VLDB Endowment
archive
Volume 1 , Issue 2 (August 2008)
table of contents
SESSION: Demonstrations: web, textual data
table of contents
Pages 1440-1443
Year of Publication: 2008
ISSN:2150-8097
|
|
Authors
|
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 14, Downloads (12 Months): 156, Citation Count: 1
|
|
|
ABSTRACT
Current search engines such as Google and Yahoo! are prevalent for searching the Web. Search in dynamic pages, however, is either inexistent or far from perfect. AJAX and Rich Internet Application are such applications. They are increasingly frequent on the Web (in YouTube, Amazon, GMail, Yahoo!Mail) or mobile devices and are offering a high degree of interactivity to the user, by seamlessly loading content from the server without the need to refresh the page. Current search engines cannot correctly index AJAX applications. This produces false positives and false negatives, because search engines do not understand the application logic that loads content dynamically. Crawling an AJAX application is a difficult problem. Since the user invokes events on the page, crawling must identify the different application states generated by the client-side logic. This demo sets the stage for this new type of search and shows that a search engine for AJAX can be built. Among others, the challenges, as opposed to traditional search engines, are: automatically identifying states by triggering events, efficiently crawling application states, avoiding the invocation of potentially very numerous events, scalability in the number of events, duplicate elimination of states, result presentation and aggregation, ranking. The demo presents the AJAX search engine: crawler, indexer and query processor, applied on a real application and showcases challenges and solutions.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
AJAX News with calendar. http://www.giannifrey.com/ajax/news.cfm?showCalendar=true.
|
| |
3
|
Sample AJAX News Application. http://www.giannifrey.com/ajax/news.html.
|
| |
4
|
S. Amer-Yahia, C. Botev, S. Buxon, P. Case, J. Doerre, D. McBeath, M. Rys, and J. Shanmugasundaram. XQuery 1.0 and XPath 2.0 Full-Text, W3C Working Draft, 4 April 2005. http://www.w3.org/TR/2005/WD-xquery-full-text-20050404/.
|
 |
5
|
|
| |
6
|
|
| |
7
|
G. Bhalotia, C. Nakhe, A. Hulgeri, S. Chakrabarti, and S. Sudarshan. Keyword searching and browsing in databases using BANKS. In ICDE, 2002.
|
| |
8
|
S. Boag, D. Chamberlin, M. F. Fernndez, D. Florescu, J. Robie, and J. Simon. XQuery 1.0: An XML Query Language W3C Candidate Recommendation, 3 November 2005. http://www.w3.org/TR/2005/CR-xquery-200511033.
|
| |
9
|
COBRA Toolkit. http://html.xamjwg.org/cobra.jsp.
|
 |
10
|
|
| |
11
|
|
| |
12
|
Yahoo! Mail. http://mail.yahoo.com.
|
|