ACM Home Page
Please provide us with feedback. Feedback
Automated construction of web accessibility models from transaction click-streams
Full text PdfPdf (2.49 MB)
Source
International World Wide Web Conference archive
Proceedings of the 18th international conference on World wide web table of contents
Madrid, Spain
SESSION: Web engineering/session: end user web engineering table of contents
Pages 871-880  
Year of Publication: 2009
ISBN:978-1-60558-487-4
Authors
Jalal Mahmud  IBM, San Jose, CA, USA
Yevgen Borodin  Stony Brook University, Stony Brook, NY, USA
I. V. Ramakrishnan  Stony Brook University, Stony Brook, NY, USA
C. R. Ramakrishnan  Stony Brook University, Stony Brook, NY, USA
Sponsor
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 32,   Downloads (12 Months): 130,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1526709.1526826
What is a DOI?

ABSTRACT

Screen readers, the dominant assistive technology used by visually impaired people to access the Web, function by speaking out the content of the screen serially. Using screen readers for conducting online transactions can cause considerable information overload, because transactions, such as shopping and paying bills, typically involve a number of steps spanning several web pages. One can combat this overload by using a transaction model for web accessibility that presents only fragments of web pages that are needed for doing transactions. We can realize such a model by coupling a process automaton, encoding states of a transaction, with concept classifiers that identify page fragments "relevant" to a particular state of the transaction. In this paper we present a fully automated process that synergistically combines several techniques for transforming unlabeled click-stream data generated by transactions into a transactionmodel. These techniques include web content analysis to partition a web page into segments consisting of semantically related content, contextual analysis of data surrounding clickable objects in a page, and machine learning methods, such as clustering of page segments based on contextual analysis, statistical classification, and automata learning. The use of unlabeled click streams in building transaction models has important benefits: (i) visually impaired users do not have to depend on sighted users for creating manually labeled training data to construct the models; (ii) it is possible to mine personalized models from unlabeled transaction click-streams associated with sites that visually impaired users visit regularly; (iii) since unlabeled data is relatively easy to obtain, it is feasible to scale up the construction of domain-specific transaction models (e.g., separate models for shopping, airline reservations, bill payments, etc.); (iv) adjusting the performance of deployed models over timtime with new training data is also doable. We provide preliminary experimental evidence of the practical effectiveness of both domain-specific, as well as personalized accessibility transaction models built using our approach. Finally, this approach is applicable for building transaction models for mobile devices with limited-size displays, as well as for creating wrappers for information extraction from web sites.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
JAWS Screen Reader. http://www.freedomscientific.com.
 
2
 
3
J. F. Allen, N. Chambers, G. Ferguson, L. Galescu, H. Jung,M. D. Swift, and W. Taysom. Plow: A collaborative task learning agent. In Proc. of AAAI, 2007.
4
5
 
6
A. Banerjee and J. Ghosh. Click-stream clustering using weighted longest common subsequences. In Proc. of the Web Mining Workshop at the 1st SIAM Conference on Data Mining, pages 33--40, 2001.
7
8
 
9
 
10
 
11
D. Geoffray. The internet through the eyes of windows-eyes. In Proc. of Tech. and Persons with Disabilities Conf., 1999.
 
12
G. Greco, A. Guzzo, L. Pontieri, and D. Sacca. Mining expressive process models by clustering workflow traces. 2004.
 
13
14
 
15
P. Jaccard. Bulletin del la soci?t? vaudoisedes sciences naturelles 37. pages 241--272, 1901.
 
16
 
17
 
18
K. Lerman, A. Plangprasopchok, and C. A. Knoblock. Automatically labeling the inputs and outputs of web services. In Proc. of AAAI, 2006.
19
20
21
 
22
 
23
K. Murphy. Passively learning finite automata, 1996.
 
24
O. Nasraoui, C. Cardona, and C. Rojas. Mining of evolving web click-streams with explicit retrieval similarity measures. In Proc. of "International Web Dynamics Workshop", International World Wide Web Conference, 2004.
25
 
26
 
27
R. Silva, J. Zhang, and J. Shanahan. Probabilistic workflow mining. 2005.
28
 
29
30
31
 
32
 
33
V. Vapnik. Principles of risk minimization for learning theory. In D. S. Lippman, J. E. Moody, and D. S. Touretzky, editors, Advances in Neural Information Processing Systems 3, 1992.
34
 
35
M. Zajicek, C. Powell, and C. Reeves. Web search and orientation with brookestalk. In Proc. of Tech. and Persons with Disabilities Conf., 1999.

Collaborative Colleagues:
Jalal Mahmud: colleagues
Yevgen Borodin: colleagues
I. V. Ramakrishnan: colleagues
C. R. Ramakrishnan: colleagues