| Similarity analysis on government regulations |
| Full text |
Pdf
(255 KB)
|
| Source
|
International Conference on Knowledge Discovery and Data Mining
archive
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
table of contents
Washington, D.C.
POSTER SESSION: Industrial/government track
table of contents
Pages: 711 - 716
Year of Publication: 2003
ISBN:1-58113-737-0
|
|
Authors
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 7, Downloads (12 Months): 42, Citation Count: 5
|
|
|
ABSTRACT
Government regulations are semi-structured text documents that are often voluminous, heavily cross-referenced between provisions and even ambiguous. Multiple sources of regulations lead to difficulties in both understanding and complying with all applicable codes. In this work, we propose a framework for regulation management and similarity analysis. An online repository for legal documents is created with the help of text mining tool, and users can access regulatory documents either through the natural hierarchy of provisions or from a taxonomy generated by knowledge engineers based on concepts. Our similarity analysis core identifies relevant provisions and brings them to the user's attention, and this is performed by utilizing both the hierarchical and referential structures of regulations to provide a better comparison between provisions. Preliminary results show that our system reveals hidden similarities that are not apparent between provisions based on node content comparisons.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
ADA Accessibility Guidelines for Buildings and Facilities. The Access Board, 1998.
|
| |
2
|
|
| |
3
|
|
 |
4
|
Kurt D. Bollacker , Steve Lawrence , C. Lee Giles, CiteSeer: an autonous Web agent for automatic retrieval and identification of interesting publications, Proceedings of the second international conference on Autonomous agents, p.116-123, May 10-13, 1998, Minneapolis, Minnesota, United States
[doi> 10.1145/280765.280786]
|
| |
5
|
|
| |
6
|
British Standard 8300. British Standards Institution (BSI), 2001.
|
| |
7
|
California Building Code. California Building Standards Commission, 1998.
|
 |
8
|
Jochen Dörre , Peter Gerstl , Roland Seiffert, Text mining: finding nuggets in mountains of textual data, Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining, p.398-401, August 15-18, 1999, San Diego, California, United States
[doi> 10.1145/312129.312299]
|
| |
9
|
Gibbens, M. P. California Disabled Accessibility Guidebook 2000. Builder's Book, Canoga Park, CA, 2000.
|
 |
10
|
|
| |
11
|
International Building Code 2000. International Conference of Building Officials, 2000.
|
| |
12
|
Kidder, F. and Parker, H. Kidder-Parker Architects' and Builders' Handbook. John Willey & Sons, London, UK, 1931.
|
| |
13
|
Mitra, P. and Wiederhold, G. Resolving terminological heterogeneity in ontologies. in Proceedings of Workshop on Ontologies and Semantic Interoperability at the 15th European Conference on Artificial Intelligence (ECAI) (Lyon, France, 2002).
|
| |
14
|
Porter, M. F. An algorithm for suffix stripping. Program: Automated Library and Information Systems, 14 (3). 130--137.
|
| |
15
|
|
| |
16
|
|
| |
17
|
Semio Tagger. Semio Corporation, 2002. http://www.semio.com.
|
| |
18
|
Uniform Federal Accessibility Standards (UFAS). The Access Board, 1986.
|
| |
19
|
Extensible Markup Language (XML). World Wide Web Consortium (W3C), 2003. http://www.w3.org/XML.
|
CITED BY 5
|
|
|
|
|
Kincho H. Law , Gio Wiederhold , Gloria T. Lau , Xiaoshan Pan , Haoyi Wang , Li Zhang, REGBASE: a distributed information infrastructure for regulation management and compliance checking, Proceedings of the 2004 annual national conference on Digital government research, p.1-2, May 24-26, 2004, Seattle, WA
|
|
|
|
|
|
|
|
|
|
|