ACM Home Page
Please provide us with feedback. Feedback
On document splitting in passage detection
Full text PdfPdf (225 KB)
Source
Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval table of contents
Singapore, Singapore
POSTER SESSION: Posters group 4: theory and IR models table of contents
Pages 833-834  
Year of Publication: 2008
ISBN:978-1-60558-164-4
Authors
Nazli Goharian  Illinois Institute of Technology, Chicago, IL, USA
Saket S.R. Mengle  Illinois Institute of Technology, Chicago, IL, USA
Sponsors
ACM: Association for Computing Machinery
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 11,   Downloads (12 Months): 78,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Request Permissions Request Permissions    Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1390334.1390528
What is a DOI?

ABSTRACT

Passages can be hidden within a text to circumvent their disallowed transfer. Such release of compartmentalized information is of concern to all corporate and governmental organization. We explore the methodology to detect such hidden passages within a document. A document is divided into passages using various document splitting techniques, and a text classifier is used to categorize such passages. We present a novel document splitting technique called dynamic windowing, which significantly improves precision, recall and F1 measure.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
Mengle S., Goharian N, Detecting Hidden Passages from Documents. In proceedings of SIAM Conference on Data Mining (SDM 2008) Workshop, 2008
 
3
Mengle S, Goharian N., Platt A., FACT: Fast Algorithm for Categorizing Text, In proceedings of IEEE 5th International Conference on Intelligence and Security Informatics, 2007, Pg. 308--315.
4

Collaborative Colleagues:
Nazli Goharian: colleagues
Saket S.R. Mengle: colleagues