Multilevel Representation and Query
Processing in Multimedia Database Systems
Arif Ghafoor (PI), Rangasami
L. Kashyap (Co-PI), Shankar Moni (Co-PI)
Purdue University,
West Lafayette, IN, 47907
Contact Information
Arif
Ghafoor
School of Electrical
and Computer Engineering, Purdue University, West Lafayette, IN, 47907
Phone: (765)
494-0638 Fax: (765) 494-3371
Email: ghafoor@ecn.purdue.edu
WWW
PAGE
Distributed
Multimedia Systems Laboratory (URL:
http://shay.ecn.purdue.edu/~dmultlab/nsf_proj/)
Supported Students
- Shu-Ching Chen (PhD completed in 12/1998)
- Wasfi Al-Khatib (Graduate Research Assistant)
- Srinivas Sista (Graduate Research Assistant)
Keywords
multimedia data representation,
fuzzy queries, spatio-temporal modeling, content-based retrieval, image
processing
Project Award Information
-
Project award number:
IRI-9619812
-
Name of project: Multilevel
Representation and Query Processing in Multimedia Database Systems
-
Duration of award: September
1997 to September 2000
-
Current award year: September
1998 to September 1999
Project Summary
The goal of this project
is to develop a multimedia database system with capabilities to handle
heterogeneous media queries. This system caters to the computational and
storage requirements while accommodating and exploiting the inevitable
semantic and representational imprecisions. The design of this system is
based on multilevel data models and search mechanisms. These methodologies
facilitate the users for posing various types of queries, including: (i)
low-level, such as finding objects in a multimedia database, (ii) mid-level,
based on spatio-temporal semantics, such as locating events associated
with multimedia data, and (iii) high-level, targeted towards searching
pre-composed multimedia documents, based on constituent mono-media data,
their spatio-temporal dimensions, and logical structure. The multi-level
search mechanisms are tightly interlinked. Imprecision in the search results
is modeled using a set of fuzzy parameters. The results of this research
are helping develop a comprehensive framework for building wide variety
of multimedia applications in commercial, educational, governmental and
military sectors.
Goals, Objectives,
and Targeted Activities
Our long term goals include the development
of a generic multi-level representation of multimedia data that can
overcome several challenges faced by the database community. In
particular, for low-level data modeling our research will focus on
evaluating the unsupervised classification approach for image
segmentation using a log-likelihood function. Two image segmentation
models, namely the facet model and the texture model, are being
evaluated in terms of computation and exactness of match. We are also
planning to develop and evaluate other models. For mid-level
multimedia data representation, we are planning to develop indexing,
Petri-net and neighborhood graph models for semantic modeling and
managing fuzziness in query formulation.
Indication of Success
So far the stated objectives
for this project have been met quite well, as demonstrated by the concrete
research results produced to date. Success of this project in
terms of quality research results has been aided by the collaboration between
two researchers with backgrounds in image processing and multimedia databases.
This collaboration has been instrumental in providing a sharp understanding
of research challenges astride these areas. As a result of this unique
collaboration, several research ideas and publications are being produced. The following research results has been obtained:
- We have developed a formulation for video segmentation and object
tracking. This formulation does not require the supervision of a human
user. Each frame in the video is partitioned into different segments
and the segments are combined to form object traces. We have provided an
algorithm that simultaneously partitions a video frame and obtains the
parameters of the underlying classes. The problem of partitioning each
frame is posed as a joint estimation of the partition and class
parameter variables. By incorporating the partition information of the
previous frame into the segmentation process of the current frame, our
method implicitly uses the temporal information. Experimental results have
shown that our method succeeds in capturing the object classes even
when the objects undergo translations and rotations not in the plane
of the image.
- We have proposed methods that serve as such spatio-temporal models
for event-based retrieval of video. We have presented description of
characteristics and identification of requirements for motion-based
video retrieval. Based on this discussion, we have proposed two
alternative and complementary schemes, trajectory and
trail-based models for representing the motion in
video. Trail-based models are proposed to handle temporal scale
(speed) invariant searches. A Mellin transform based scale invariant
pattern recognition technique has been used in our proposed algorithm
to perform scale invariant searches. Trajectory based models handle
most other types of motion-based searches in video data. Two
algorithms for handling spatial translation-absolute Match has been
proposed: A Fourier transform-based distance algorithm and a Two-stage
algorithm that reduces the complexity of the first algorithm. We
support the proposed motion representation models with effective and
efficient searching techniques. In addition to general requirements
such as flexibility and efficiency, these techniques categorically
address the invariance features in spatial and temporal domain.
- For the search case that does not require any invariance feature,
we have proposed computationally efficient searching techniques based
on common statistical methods and carried out an extensive
analysis. In addition to computational efficiency, these methods
provide flexibility to the user in determining the right search
parameters for optimum accuracy/performance tradeoff.
- User interface implementation: The proposed technique alleviates
the limitations of keyword-based search techniques and provides an
effective example-based query entry mechanism as part of the
implementation. A video motion indexing tool has been developed to
perform object tracking using a semi-automatic tracking tool for
capturing MBR's. A query tool has been developed to facilitate the
input of motion-based queries by sketch.
The project is expected to provide viable solutions for developing a
general framework for developing multimedia databases needed for a
broad range of applications. The NSF funding has provided an opportunity
for this collaboration which would have been difficult,
otherwise.
Project Impact and
Output
Two students are currently pursuing their doctoral studies as
part of this project. One student has completed his doctoral thesis in
this area. The research results of this project have been incorporated
in a graduate level course on multimedia systems (EE 624), which is
taught by the PI. The project and its planned implementation is a
cornerstone of the cutting-edge research being carried out in our
lab. We anticipate interest from industrial organizations as the
project matures and implementation-worthy results are
produced.
Project
References
[1] Shu-Ching
Chen and R. L. Kashyap, "A Spatial-Temporal Semantic Model for
Multimedia Presentations and Multimedia Database Systems" to appear in
IEEE Trans. on Knowledge and Data Engineering.
[2] S. Dagtas,
W. Al-Khatib, A. Ghafoor, and R. Kashyap, ``Models for Motion-based
Video Indexing and Retrieval'', Submitted to IEEE Transactions on
Image Processing
[3] S. Dagtas,
W. Al-Khatib, A. Khokhar, and A. Ghafoor, ``Trail-Based Approach for
Video Data Indexing and Retrieval'', Submitted to IEEE International
Conference on Multimedia Computing and Systems (ICMCS)
[4] Y. F. Day,
A. Khokhar, S. Dagtas, and A. Ghafoor, ``A Multi-level Abstraction and
Modeling in Video Databases'', to appear in ACM Journal on Multimedia
Systems.
[5] S. Sista and
R. L. Kashyap, "Unsupervised Video Segmentation and Object Tracking,"
submitted to IEEE International Conference on Image Processing, Kobe,
Japan, October 1999.
[6] Shu-Ching
Chen and R. L. Kashyap, "Empirical Studies of Multimedia Semantic
Models for Multimedia Presentations," 13th International Conference on
Computer and Their Applications, Honolulu, Hawaii, USA, March 25-27,
1998.
[7] Shu-Ching
Chen and R. L. Kashyap, "Temporal and Spatial Semantic Models for
Multimedia Presentations," 1997 International Symposium on Multimedia
Information Processing, Dec. 11-13, 1997.
Area Background
Our project is concerned
with data management and information retrieval technologies essential for
developing future multimedia systems. The emphasis of our research is on
developing an automated system to allow multi-level data representation
to assist query processing at different levels of abstraction. It will
allow users to access images and video data based on appearance of objects
as well as events surrounding these objects. The key tradeoff for users
is between the accuracy of matching and the computational cost of the query.
The framework developed for this project will provide solutions for challenging
problems in multimedia data organization and integration, indexing and
retrieval mechanisms, intelligent searching techniques, information browsing,
content-based query processing and so forth. A large variety of potential
applications will benefit from this framework.
Area References
There are several journals
and conferences which have excellent coverage of issues in this area. They
include: IEEE Multimedia; ACM Journal on Multimedia Systems; IEEE Trans.
on Knowledge and Data Engineering; ACM Multimedia Conference; IEEE Int. Conf.
on Multimedia Computing and Systems. Several special issues from IEEE Computer
and ACM Journal on Multimedia Systems have been specifically devoted to
this topic. Several industrial projects undertaken by IBM, Siemens, NEC,
Oracle, Fuji Electric Co., etc., are focused on this topic.
Potential Related
Projects
Within the NSF IDM program, several projects related to
multimedia data modeling and management are being conducted in UCLA,
Case Western Reserve University, University of Maryland, University of
Nevada, University of Illinois at Chicago, University of California at
Santa Barbara, University of Pittsburgh, University of Maine, and
University of Washington.