|
ABSTRACT
We report on the user requirements study and preliminary implementation phases in creating a digital library that indexes and retrieves educational materials on math. We first review the current approaches and resources for math retrieval, then report on the interviews of a small group of potential users to properly ascertain their needs. While preliminary, the results suggest that meta-search and resource categorization are two basic requirements for a math search engine. In addition, we implement a prototype categorization system and show that the generic features work well in identifying the math contents from the webpage but perform less well at categorizing them. We discuss our long term goals, where we plan to investigate how math expressions and text search may be best integrated.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
G. Attardi, A. Gullí, and F. Sebastiani. Automatic Web page categorization by link and context analysis. In C. Hutchison and G. Lanzarone, editors, Proceedings of THAI-99, European Symposium on Telematics, Hypermedia and Artificial Intelligence, pages 105--119, Varese, IT, 1999.
|
| |
2
|
|
 |
3
|
|
 |
4
|
|
| |
5
|
|
| |
6
|
G. Buchanan, S. J. Cunningham, A. Blandford, J. Rimmer, and C. Warwick. Information seeking by humanities scholars. In ECDL, pages 218--229, 2005.
|
| |
7
|
D. Cai, S. Yu, J.-R. Wen, and W.-Y. Ma. Extracting content structure for web pages based on visual representation. In Fifth Asia Pacific Web Conference (APWeb2003), 2003.
|
| |
8
|
D. O. Case. Looking for Information, Second Edition: A Survey of Research on Information Seeking, Needs, and Behavior (Library and Information Science). Academic Press, 2006.
|
| |
9
|
M. B. Eisenberg and R. E. Berkowitz. Information problem-solving: the Big Six Skills approach to library and information skills instruction. Norwood, NJ: Albex Publishing, 1990.
|
| |
10
|
|
| |
11
|
M. Hearst. Design recommendations for hierarchical faceted search interfaces. In ACM SIGIR Workshop on Faceted Search, 2006.
|
| |
12
|
P. Jipsen. Text-based input formats for mathematical formulas. In The Evolution of Mathematical Communication in the Age of Digital Libraries, IMA "Hot Topics" Workshop, U.S.A, 2006.
|
| |
13
|
F. Kamareddine, R. Lamar, M. Maarek, and J. B. Wells. Restoring natural language as a computerised mathematics input method. In Towards Mechanized Mathematical Assistants, MKM 2007, pages 280--295, 2007.
|
| |
14
|
M. Kan, J. Klavans, and K. McKeown. Linear segmentation and segment significance. 1998.
|
| |
15
|
|
| |
16
|
M. Kohlhase and I. Sucan. A search engine for mathematical formulae. In Proceedings of Artificial Intelligence and Symbolic Computation, AISC 2006, number 4120 in LNAI, pages 241--253. Springer Verlag, 2006.
|
| |
17
|
H. Kruger. Searching mathematics with zentralblatt math: Overview and outlook. In Enhancing the Searching of Mathematics, IMA "Hot Topics" Workshop, U.S.A, 2004.
|
| |
18
|
A. M. Lau. Advancing PARCELS: PARser for content extraction and logical structure using inter- and intra-similarity features. Technical report, National University of Singapore, 2005.
|
 |
19
|
|
 |
20
|
|
| |
21
|
P. Libbrecht and E. Melis. Methods to access and retrieve mathematical content in activemath. In ICMS, volume 4151 of Lecture Notes in Computer Science, pages 331--342. Springer, 2006.
|
| |
22
|
R. Miner and R. Munavalli. An approach to mathematical search through query formulation and data normalization. In Towards Mechanized Mathematical Assistants, MKM 2007, pages 342--355, 2007.
|
| |
23
|
G. Newby. Information space based on HTML structure. In The Ninth Text REtrieval Conference (TREC 9), pages 601---610, 2000.
|
 |
24
|
|
| |
25
|
S. Wiberley and W. G. Jones. Time and technology: A decade-long look at humanists' use of electronic information technology. College and Research Libraries, 61, September, pages 421--431, 2000.
|
|