Comprehensive List of Researchers "Information Knowledge"
Department of Media Science
- Name
- IDE, Ichiro
- Group
- Speech and Image Science Group
- Title
- Associate Professor
- Degree
- Dr. of Engineering
- Research Field
- Multimedia contents processing
Current Research
Contents Understanding of Large-scale Broadcast Video Data Collection
1.OverviewVarious media for obtaining and to distributing video streams have been providing our society with a huge amount of video data. It is necessary to analyze them and exploit these data to support a wide-range of human activities. However, it is difficult to apply conventional contents analysis or pattern recognition methods in handling large-scale collections. We are thus working on supporting efficient and effective understanding of video contents by providing retrieval and browsing abilities to users supported by knowledge extraction from a large-scale video data collection
2.Research theme
Among various genres, we are focusing on broadcast news video, which could be considered a common media heritage of human society that records its detailed activities. These video data, however, have long been in the danger of disintegration, until fairly recent efforts have been made to store them in archives. Even now, research on methods and tools to retrieve specific video data or to extract valuable information from massive amounts of video data is under way.
In order to cope with these problems, We have been working on several hundreds of hours of broadcast news video related to the following tasks :
・Extracting news topic thread structures
A “Topic thread” structure that represents both chronological and semantic relations between news stories is automatically extracted in order to track the development of a news topic in focus. By constructing a browsing interface based on this structure, users are provided with the ability to efficiently and effectively understand the topic in focus.
・Assembling monologue collections
The merit of news video over text (newspapers and magazines) or audio (radio) is that it contains nonverbal visual information, e.g. facial expressions and the mood of the person in a monologue scene, such as speech or interview scenes. We are thus working on automatically detecting monologue scenes by referring to multimodal cues in the video data.
・Extracting human relationships in news
We are working on extraction of human relationships from news stories according to the co-occurrences of news subjects, together with better understanding of news contents based on such knowledge.
In addition to these developments, we are also interested in pattern recognition methods in image, text, and audio data to support them.
3.Future works
Systems that respond promptly to a user’s query, such as automatic documentary generation on a specified topic, are also under consideration.
4.Other works
We have been working on other video data besides news video :
・Video data from car-mounted cameras : Driver support, Creation of a cityscape database
・Cooking shows : Indexing cooking steps, Application for cooking support
・Soccer shows : Tracking of players on the field
・Soap-operas : Indexing of characters’ activities
Apart from work in this field, we have also worked on the following theme :
・Designing an infrastructure for academic information distribution : Designing a researcher's personal information management system
Figure : (Top) Example of a thread structure, (Bottom) The topic tracking interface
Career
- 2000 Dr. Eng., The Univ. of Tokyo
- 2000 Assistant Prof., National Institute of Informatics
- 2002 Assistant Prof., Graduate Univ. of Advanced Studies (concurrent)
- 2004 Associate Prof., Nagoya Univ. ; Visiting Associate Prof., National Institute of Informatics (concurrent)
- 2004 Associate Prof., Nagoya Univ. ; Visiting Associate Prof., National Institute of Informatics (concurrent) (-2010)
- 2010 Senior Visiting Researcher, University of Amsterdam Institute of Informatics (-2011)
Academic Societies
- IEEE
- ACM
- IEICE
- IPS Japan
- JSAI
- ITE
- The Association for Natural Language Processing
Publications
- I. Ide, T. Kinoshita, T. Takahashi, H. Mo, N. Katayama, S. Satoh, H. Murase: "Efficient tracking of news topics based on chronological semantic structures in a large-scale news video archive", IEICE Trans. on Fundamentals of Electronics, Communications and Computer Sciences, E95-D(5), 1288-1300, 2012.
- I. Ide, R. Hamada, S. Sakai, H. Tanaka : “Semantic analysis of television news captions referring to suffixes,” Proc 4th Intl Workshop on Information Retrieval in Asian Languages, 37-42, 1999.
- I. Ide, R. Hamada, S. Sakai, H. Tanaka : “Identification of scenes in news video from image features of background region,” Proc. 1st Intl Workshop on Multimedia Information Storage and Retrieval Management, 1999.
- F. Nack, I. Ide: "Why did the Prime Minister resign? -Generation of event explanation from large news repositories-", Proc. 19th ACM Int. Multimedia Conf. (ACM-MM2011), pp.313-322, 2011