|
ABSTRACT
Because of the media digitization, a large amount of information such as speech, audio and video data is produced everyday. In order to retrieve data quickly and precisely from these databases, multimedia technologies for organizing and retrieving of speech, audio and video data are strongly required. In this paper, we overview the multimedia technologies such as organization and retrieval of speech, audio and video data, speaker indexing, audio summarization and cross media retrieval existing today. The main purpose of the organization is to produce tables of contents and indices from audio and video data automatically. In order to make these technologies feasible, first, processing units such as words on audio data and shots on video data are extracted. On a second step, they are meaningfully integrated into topics. Furthermore, the units extracted from different types of media are integrated for higher functions.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
Dharanipragada, S., Franz. M., McCarley, J.S., Roukos, S. and Ward. T., "Story Segmentation and Topic Detection for Recognized Speech," Proc. of Eurospeech99, pp.2435-2438, 1999.
|
| |
3
|
Walls, F., Jim H., Sista. S. and Schwartz, R., "Topic Detection in Broadcast News." Proc. of Eurospeech99, pp.2451-2454, 1999.
|
| |
4
|
|
| |
5
|
Siegler, M.A., "Experiments in Spoken Document Retrieval at CMU." TRECT, 1998.
|
| |
6
|
Gey, F.C., Jiang, H., Chen, A. and Larson, R.R., "Manual Queries and Machine Translation in Cross-Language Retrieval and Interactive Retrieval with Cheshire II at TREC-7." Proc. of TREC-7, pp527-540, 1997.
|
| |
7
|
|
| |
8
|
Delacourt, P., Kryze, D. and Vqellekens, J.C. "Detection of Speaker Changes in an Audio Document," Proc. Eurospeech'99, pp.1195-1198, 1999.
|
| |
9
|
Linde, Y., Buzo, A. and Gray, R.M., "'An algorithm for vector quantizer design," IEEE Trans. Commun., COM-28, pp.84-95. 1980.
|
| |
10
|
Kiyama. J., Itoh. Y. and Oka, R., "Automatic Detection of Topic Boundary and Keywords in Arbitrary Speech Using Incremental Reference Interval-free Continuous DP," Proc. of ICSLP96: pp. 1946-1949, 1996.
|
| |
11
|
Ariki. Y. and Saito, Y., "'Extraction of TV News Articles based on Scene Cut Detection Using DCT Clustering," Proc. of ICIP96. pp. III847-III850, 1996.
|
| |
12
|
Smith, M.A. and Kanade, T., "'Video skimming and characterization through the combination of image and language understanding technique, " CMU-CS-97-111, 1997.
|
| |
13
|
|
| |
14
|
Muller, S., Eickeler, S., Rigoll, G., "Pseudo 3-D HMMs for Image Sequence Recognition," Proc. of ICIP99. 28APl.11, 1999.
|
| |
15
|
Kurokawa. M., Echigo, T., Tomita,A., Maeda, J., Miyamori. H. and Iisaku. S., "Representation and Retrieval of Video Scene by Using Object Actions and Their Spatio-Temporal Relationships," Proc. of {CIP99. 26AO2.1, 1999.
|
| |
16
|
Yaginuma, Y. and Sakauchi, Y., "Content-based Retrieval and Decomposition of TV Drama based on Inter-media Synchronization," First Int. Conf. on Visual Information Systems, 1996.
|
| |
17
|
|
| |
18
|
|
|