ACM Home Page
Please provide us with feedback. Feedback
Math information retrieval: user requirements and prototype implementation
Full text PdfPdf (363 KB)
Source
International Conference on Digital Libraries archive
Proceedings of the 8th ACM/IEEE-CS joint conference on Digital libraries table of contents
Pittsburgh PA, PA, USA
SESSION: Expanding search table of contents
Pages 187-196  
Year of Publication: 2008
ISBN:978-1-59593-998-2
Authors
Jin Zhao  National University of Singapore, Singapore, Singapore
Min-Yen Kan  National University of Singapore, Singapore, Singapore
Yin Leng Theng  Nanyang Technological University, Singapore, Singapore
Sponsors
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
SIGIR: ACM Special Interest Group on Information Retrieval
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 11,   Downloads (12 Months): 157,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1378889.1378921
What is a DOI?

ABSTRACT

We report on the user requirements study and preliminary implementation phases in creating a digital library that indexes and retrieves educational materials on math. We first review the current approaches and resources for math retrieval, then report on the interviews of a small group of potential users to properly ascertain their needs. While preliminary, the results suggest that meta-search and resource categorization are two basic requirements for a math search engine. In addition, we implement a prototype categorization system and show that the generic features work well in identifying the math contents from the webpage but perform less well at categorizing them. We discuss our long term goals, where we plan to investigate how math expressions and text search may be best integrated.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
G. Attardi, A. Gullí, and F. Sebastiani. Automatic Web page categorization by link and context analysis. In C. Hutchison and G. Lanzarone, editors, Proceedings of THAI-99, European Symposium on Telematics, Hypermedia and Artificial Intelligence, pages 105--119, Varese, IT, 1999.
 
2
3
4
 
5
 
6
G. Buchanan, S. J. Cunningham, A. Blandford, J. Rimmer, and C. Warwick. Information seeking by humanities scholars. In ECDL, pages 218--229, 2005.
 
7
D. Cai, S. Yu, J.-R. Wen, and W.-Y. Ma. Extracting content structure for web pages based on visual representation. In Fifth Asia Pacific Web Conference (APWeb2003), 2003.
 
8
D. O. Case. Looking for Information, Second Edition: A Survey of Research on Information Seeking, Needs, and Behavior (Library and Information Science). Academic Press, 2006.
 
9
M. B. Eisenberg and R. E. Berkowitz. Information problem-solving: the Big Six Skills approach to library and information skills instruction. Norwood, NJ: Albex Publishing, 1990.
 
10
 
11
M. Hearst. Design recommendations for hierarchical faceted search interfaces. In ACM SIGIR Workshop on Faceted Search, 2006.
 
12
P. Jipsen. Text-based input formats for mathematical formulas. In The Evolution of Mathematical Communication in the Age of Digital Libraries, IMA "Hot Topics" Workshop, U.S.A, 2006.
 
13
F. Kamareddine, R. Lamar, M. Maarek, and J. B. Wells. Restoring natural language as a computerised mathematics input method. In Towards Mechanized Mathematical Assistants, MKM 2007, pages 280--295, 2007.
 
14
M. Kan, J. Klavans, and K. McKeown. Linear segmentation and segment significance. 1998.
 
15
 
16
M. Kohlhase and I. Sucan. A search engine for mathematical formulae. In Proceedings of Artificial Intelligence and Symbolic Computation, AISC 2006, number 4120 in LNAI, pages 241--253. Springer Verlag, 2006.
 
17
H. Kruger. Searching mathematics with zentralblatt math: Overview and outlook. In Enhancing the Searching of Mathematics, IMA "Hot Topics" Workshop, U.S.A, 2004.
 
18
A. M. Lau. Advancing PARCELS: PARser for content extraction and logical structure using inter- and intra-similarity features. Technical report, National University of Singapore, 2005.
19
20
 
21
P. Libbrecht and E. Melis. Methods to access and retrieve mathematical content in activemath. In ICMS, volume 4151 of Lecture Notes in Computer Science, pages 331--342. Springer, 2006.
 
22
R. Miner and R. Munavalli. An approach to mathematical search through query formulation and data normalization. In Towards Mechanized Mathematical Assistants, MKM 2007, pages 342--355, 2007.
 
23
G. Newby. Information space based on HTML structure. In The Ninth Text REtrieval Conference (TREC 9), pages 601---610, 2000.
24
 
25
S. Wiberley and W. G. Jones. Time and technology: A decade-long look at humanists' use of electronic information technology. College and Research Libraries, 61, September, pages 421--431, 2000.

Collaborative Colleagues:
Jin Zhao: colleagues
Min-Yen Kan: colleagues
Yin Leng Theng: colleagues