Multilevel Representation and Query
Processing in Multimedia Database Systems
Arif Ghafoor (PI), Rangasami
L. Kashyap (Co-PI), Shankar Moni (Co-PI)
Purdue University,
West Lafayette, IN, 47907
Contact Information
Arif
Ghafoor
School of Electrical
and Computer Engineering, Purdue University, West Lafayette, IN, 47907
Phone: (765)
494-0638 Fax: (765) 494-3371
Email: ghafoor@ecn.purdue.edu
WWW
PAGE
Distributed
Multimedia Systems Laboratory (URL:
http://shay.ecn.purdue.edu/~dmultlab/nsf_proj/project.html)
Supported Students
- Srinivas Sista (PhD Completed in 5/1999)
- Wasfi Al-Khatib (PhD Candidate 25%)
- Husni Fahmi (Graduate Research Assistant)
- Irfan Khan (Graduate Research Assistant)
- Basit Shafiq (Graduate Research Assistant)
Keywords
multimedia data representation,
fuzzy queries, spatio-temporal modeling, content-based retrieval, image
processing
Project Award Information
-
Project award number:
IRI-9619812
-
Name of project: Multilevel
Representation and Query Processing in Multimedia Database Systems
-
Duration of award: September
1997 to September 2000
-
Current award year: September
1999 to September 2000
Project Summary
The goal of this project
is to develop a multimedia database system with capabilities to handle
heterogeneous media queries. This system caters to the computational and
storage requirements while accommodating and exploiting the inevitable
semantic and representational imprecisions. The design of this system is
based on multilevel data models and search mechanisms. These methodologies
facilitate the users for posing various types of queries, including: (i)
low-level, such as finding objects in a multimedia database, (ii) mid-level,
based on spatio-temporal semantics, such as locating events associated
with multimedia data, and (iii) high-level, targeted towards searching
pre-composed multimedia documents, based on constituent mono-media data,
their spatio-temporal dimensions, and logical structure. The multi-level
search mechanisms are tightly interlinked. Imprecision in the search results
is modeled using a set of fuzzy parameters. The results of this research
are helping develop a comprehensive framework for building wide variety
of multimedia applications in commercial, educational, governmental and
military sectors.
Goals, Objectives,
and Targeted Activities
Our long term goals include the development
of a generic multi-level representation of multimedia data that can
overcome several challenges faced by the database community. In
particular, for low-level data modeling our research will focus on
evaluating the unsupervised classification approach for image
segmentation using a log-likelihood function. Two image segmentation
models, namely the facet model and the texture model, are being
evaluated in terms of computation and exactness of match. We are also
planning to develop and evaluate other models. For mid-level
multimedia data representation, we are planning to develop indexing,
Petri-net and neighborhood graph models for semantic modeling and
managing fuzziness in query formulation.
Indication of Success
So far the stated objectives
for this project have been met quite well, as demonstrated by the concrete
research results produced to date. Success of this project in
terms of quality research results has been aided by the collaboration between
two researchers with backgrounds in image processing and multimedia databases.
This collaboration has been instrumental in providing a sharp understanding
of research challenges astride these areas. As a result of this unique
collaboration, several research ideas and publications are being produced. The following research results has been obtained:
- We have developed a conceptual model for multimedia documents that
incorporates space, time, and content information. In order to model
the spatio-temporal relationships among multimedia objects in a
multimedia document, the OCPN modeling scheme proposed by one of the
PI's has been enhanced and utilized as a multimedia document model.
- Based on the enhanced OCPN-based model, we have developed a database
architecture and are currently developing a prototype system. The
architecture of the system has the following components:
- An authoring tool for multimedia document creation
- An OCPN script generator, translator, and an automatic deadline
generator from OCPN scripts.
- A query processor allowing the user to store, query and retrieve
multimedia documents from the underlying multimedia databases.
- A presentation manager responsible for scheduling the events and actions
to play back and synchronize multimedia objects during document
presentation.
The prototype system is being implemented using an object oriented DBMS
as an underlying database management system along with C++ and X/Motif
Graphical User Interface.
- We have developed a semantic model that support content-based retrieval
and modeling for video databases. The modeling approach uses multi-level
representation of video data using hierarchical Petri-nets (HPNs). This
model captures video data semantics, from very low level semantics such as
scene change and object movements to higher level semantics involving
high-level textual description of events.
- We have developed another semantic representation of video data using a
coordinate valued neighborhood graph (CVNG) model. CVNG is an extension
to the neighborhood graph model developed by Nabil et. al. to compute
similarity between still images. In video, the spatial relationship
between two objects over a sequence of frames is captured using a symbolic
representation of a video sequence. We utilize the techniques reported
in nsf-report year 2 in query processing which provide computationally
efficient techniques to compare complex video scenes.
The project continues to provide viable solutions for developing a
general framework for developing multimedia databases needed for a
broad range of applications. The NSF funding has provided an opportunity
for this collaboration which would have been difficult,
otherwise.
Project Impact and
Output
Two students are currently pursuing their doctoral studies as
part of this project. One student has completed his doctoral thesis in
this area. The research results of this project have been incorporated
in a graduate level course on multimedia systems (EE 624), which is
taught by the PI. The project and its planned implementation is a
cornerstone of the cutting-edge research being carried out in our
lab. We anticipate interest from industrial organizations as the
project matures and implementation-worthy results are
produced.
Project
References
[1] Shu-Ching
Chen and R. L. Kashyap, "A Spatial-Temporal Semantic Model for
Multimedia Presentations and Multimedia Database Systems"
to appear in IEEE Trans. on Knowledge & Data Engineering.
[2] S. Dagtas,
W. Al-Khatib, A. Ghafoor, and R. L. Kashyap, ``Models for Motion-based
Video Indexing and Retrieval'', IEEE Trans. on Image Processing, 9(1),
pp. 88-101, January 2000
[3] W. Al-Khatib and A.
Ghafoor, ``An Approach for Video Meta-Data Modeling and Query Processing",
in the proceedings of the seventh ACM Multimedia International Conference,
pp. 215-224, Orlando, Florida, October 30-November 5, 1999.
[4] S. Dagtas and A.
Ghafoor, ``Indexing and Retrieval of Video based on Spatial Relation Sequences",
in the proceedings of the seventh ACM Multimedia International Conference,
Part 2, pp. 119-122, Orlando, Florida, October 30-November 5, 1999.
[5] Y. F. Day,
A. Khokhar, S. Dagtas, and A. Ghafoor, ``A Multi-level Abstraction and
Modeling in Video Databases'', Multimedia Systems, 7(5), pp. 409-423,
Septemeber 1999.
[6] S. Dagtas,
W. Al-Khatib, A. Khokhar, and A. Ghafoor, ``Trail-Based Approach for
Video Data Indexing and Retrieval'', In proceedings of the IEEE International
Conference on Multimedia Computing and Systems (ICMCS), Vol II, pp. 235-239,
Florence, Italy, June, 1999
[7] W. Al-Khatib, Y. F.
Day, A. Ghafoor, and P. B. Berra, ``Semantic Modeling and Knowledge
Representation in Multimedia Databases'', IEEE Trans. on Knowledge & Data
Engineering, 11(1), pp. 64-80, January-February, 1999
[8] S. Sista and
R. L. Kashyap, "Unsupervised Video Segmentation and Object Tracking,"
In proceedings of the IEEE International Conference on Image Processing, Kobe,
Japan, October 1999.
[9] Shu-Ching
Chen and R. L. Kashyap, "Empirical Studies of Multimedia Semantic
Models for Multimedia Presentations," 13th International Conference on
Computer and Their Applications, Honolulu, Hawaii, USA, March 25-27,
1998.
[10] Shu-Ching
Chen and R. L. Kashyap, "Temporal and Spatial Semantic Models for
Multimedia Presentations," 1997 International Symposium on Multimedia
Information Processing, Dec. 11-13, 1997.
Area Background
Our project is concerned
with data management and information retrieval technologies essential for
developing future multimedia systems. The emphasis of our research is on
developing an automated system to allow multi-level data representation
to assist query processing at different levels of abstraction. It will
allow users to access images and video data based on appearance of objects
as well as events surrounding these objects. The key tradeoff for users
is between the accuracy of matching and the computational cost of the query.
The framework developed for this project will provide solutions for challenging
problems in multimedia data organization and integration, indexing and
retrieval mechanisms, intelligent searching techniques, information browsing,
content-based query processing and so forth. A large variety of potential
applications will benefit from this framework.
Area References
There are several journals
and conferences which have excellent coverage of issues in this area. They
include: IEEE Multimedia; ACM Journal on Multimedia Systems; IEEE Trans.
on Knowledge and Data Engineering; ACM Multimedia Conference; IEEE Int. Conf.
on Multimedia Computing and Systems. Several special issues from IEEE Computer
and ACM Journal on Multimedia Systems have been specifically devoted to
this topic. Several industrial projects undertaken by IBM, Siemens, NEC,
Oracle, Fuji Electric Co., etc., are focused on this topic.
Potential Related
Projects
Within the NSF IDM program, several projects related to
multimedia data modeling and management are being conducted in UCLA,
Case Western Reserve University, University of Maryland, University of
Nevada, University of Illinois at Chicago, University of California at
Santa Barbara, University of Pittsburgh, University of Maine, and
University of Washington.