0% found this document useful (0 votes)

39 views7 pages

Download

The document describes a software platform called QIMERA for video object segmentation and tracking. It discusses the architecture of the platform including analysis modules, an inference engine, and a graphical user interface. It then describes an initial semi-automatic segmentation algorithm developed using the QIMERA platform, including color segmentation, motion estimation, and user interaction components.

Uploaded by

wwahi5801

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views7 pages

Download

Uploaded by

wwahi5801

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/29651748

QIMERA: a software platform for video object segmentation and tracking

Article · April 2003

DOI: 10.1142/9789812704337_0037 · Source: OAI

CITATIONS READS

15 392

5 authors, including:

Noel O’Connor Tomasz Adamek

Dublin City University Dublin City University
627 PUBLICATIONS 10,285 CITATIONS 39 PUBLICATIONS 847 CITATIONS

SEE PROFILE SEE PROFILE

Sean Marlow
Dublin City University
53 PUBLICATIONS 751 CITATIONS

SEE PROFILE

All content following this page was uploaded by Sean Marlow on 19 May 2014.

The user has requested enhancement of the downloaded file.

QIMERA: A SOFTWARE PLATFORM FOR VIDEO OBJECT
SEGMENTATION AND TRACKING

N. O’CONNOR1, T. ADAMEK, S. SAV, N. MURPHY AND S. MARLOW

Centre for Digital Video Processing
Dublin City University, Ireland
E-mail: 1Noel.OConnor@dcu.ie

In this paper we present an overview of an ongoing collaborative project in the

field of video object segmentation and tracking. The objective of the project is
to develop a flexible modular software architecture that can be used as test-bed
for segmentation algorithms. The background to the project is described, as is
the first version of the software system itself. Some sample results for the first
segmentation algorithm developed using the system are presented and directions
for future work are discussed

1. Introduction

The QIMERA1 project was initiated in March 2002 by a group of researchers sharing a
common interest in video object segmentation and tracking. They decided to form a
voluntary (i.e. non-funded) project between themselves and to invite participation from
other researchers working in the field. The project currently consists of five members who
work remotely over email.
The objective of the QIMERA project is to develop a flexible modular software
architecture for video object segmentation and tracking facilitating multiple configurations
of analysis algorithms and supporting user interaction when necessary. The goal is to
develop a system into which individual analysis tools can be easily integrated in order to
test their efficiency/accuracy. The system should not be tied to one particular
segmentation algorithm, but rather should be configurable depending on the type of
segmentation problem to be addressed and the analysis tools available. An architecture for
such a system has been proposed and an initial software implementation of the key

1
The name QIMERA is derived from the Spanish word ‘quimera’ (‘chimera’ in English)
meaning a fantastic fabrication of the mind, especially an unrealistic dream. Project
members felt this to be a suitable description for the very challenging problem of
segmentation.
Qimera web page: http://www.qimera.org

1
2

components is available. A segmentation algorithm that uses the available components has
been developed.
In this paper we present a high level overview of the system, a description of the
first algorithm implemented, and some sample results. It should be noted that the
algorithm described is the first attempt to actually use the QIMERA platform, and as
such it is quite crude.

2. System Overview

The QIMERA platform consists of graphical user interface (GUI) and a system core. The
system is designed so that the GUI and the core can be decoupled. In this way, the core
could run on one platform whilst the GUI runs on a different (remote) platform.
The system core consists of a set of configurable Analysis Modules communicating
with the GUI and each other. The Analysis Modules are designed to group different
approaches to specific image/video analysis tasks. The idea is that, for example, many
different approaches to colour segmentation could be grouped in a single Colour Analysis
Module. Thus, each Analysis Module can consist of one or more individual analysis tools
that could either work together or in competition with each other. In order to produce an
output object segmentation, the results of a number of different Analysis Modules need
to be combined – e.g. combining the results of independent colour segmentation and
motion segmentation processes. The task of scheduling when results should be combined
and actually carrying out the inference process is performed by the Inference Engine. This
structure is illustrated by a screen shot of the System Configuration Interface in Figure1.

Figure 1. System Configuration Figure 2. User Interaction Interface

Interface

Each module depicted in Figure 1 has two components: a GUI for user interaction (setting
of the initial parameters) and a processing component that implements the module. A
brief description of each module in presented below:
• input (output) module – assists the user in the selection of the input video sequence
(output results)
3

• initial mask module – assists the user in the grouping of the segmented regions in the
initial frame to form an initial object segmentation mask
• colour segmentation module – segments each frame into uniform colour regions.
• motion estimation module – estimates the motion of regions/objects.
• inference engine – combines the results of other modules
• mask correction module – allows the user to correct the object mask, if required, at
any frame during the tracking stage

3. A Semi-automatic Segmentation Algorithm using QIMERA

In order to instantiate an initial version of the platform, individual analysis tools were
developed and integrated into the relevant modules as outlined in Figure 1. Using these
analysis tools, the system was used to develop an approach to semi-automatic
segmentation. In this section we describe this approach. It should be noted that this
approach is only representative of the type of approach that could be developed with
this system.

3.1. User interaction

In order to mark objects, the user draws two coloured scribbles [1] on the initial frame:
over the foreground and background parts of the image. Since the image is pre-segmented
into uniform colour regions, the user’s scribbles specify a number of regions that are then
classified as background or foreground. If both scribbles touch the same region a conflict
occurs for that region. In this case the region is considered unselected. The scribble
interface is depicted in Figure 2.

3.2. Tracking strategy

Once the initial mask is constructed, the foreground object is segmented and updated for
each frame in the video sequence in the following manner (similar to the approach outlined
in [2]):
• Each frame is segmented into uniform regions.
• For each region in the current frame a two-parameter motion projection is estimated.
• The regions that are projected inside the previous foreground mask are selected in the
foreground mask for the current frame. The regions for which the backward projection
extends outside the previous foreground mask are labeled as outliers and their
association to foreground or background is further investigated.
The details of the modules that implement the above algorithm are described in the
following.
4

3.3. Colour segmentation module

The colour segmentation module partitions each image in the video sequence into uniform
colour regions. For this we use a modified Recursive Shortest Spanning Tree (RSST)
algorithm [3]. Our modification is based on the observation that sometimes this approach
merges regions with very different colors because of the strong penalty for joining large
regions (originally introduced to improve the spatial continuity of the final regions). The
problem is particularly visible when a small number of final regions are desired. To make
the segmentation more insensitive to small changes in illumination due to shadows etc. we
use the HSV colour space. The algorithm operates in two stages:
• The first stage is identical with the original RSST algorithm. It iteratively merges
regions (two regions per iteration) according to the distance calculated using the colour
features and region size. The process stops when the desired number of regions is
obtained. Because of the problem outlined above the specified number of regions
should not be smaller than 255.
• The second stage continues to merge regions, but a new formula is used to calculate the
distance between two regions:
1 3
d (ri , r j ) = ⋅ | saturationi − saturation j | + ⋅ | huei − hue j | (1)
4 4
where:
d ( ri , r j ) is the distance between regions i and j.
The above formula does not discourage large regions (since spatial continuity was
obtained in the first stage). The hue operations are computed modulo 360o. The region
merging stops when all distances exceed a predefined threshold.

3.4. Initial Mask Module

The user drawn scribbles classify the regions they intersect into background and
foreground respectively. Not all the regions in the initial image are intersected by one of
the scribbles 2 therefore they are not explicitly assigned to background or foreground
during user interaction. The unclassified regions are labeled iteratively. For all unlabeled
regions, the distances to the available objects (foreground/background) are calculated as:
min[d(ur, lr)]
dur (l) = (2)
∑ d (ur, lr)
where:
d ur (l ) is the distance to the closest classified region
min[ d (ur , lr )] - is the minimum distance to each labeled region.

2
It is generally desirable to keep user interaction to a minimum.
5

d ( ur , lr ) - is the distance to a labeled region.

The distance measure incorporates the colour and spatial features of the regions. At every
iteration the region which has the smallest distance is assigned to the foreground or
background respectively. The assignation of the regions ends when all regions are labeled

3.5. Motion Estimation Module

Motion estimation is performed on regions using backward motion projection. Every
region in the current frame is backward projected into the previous frame based on a
colour metric and using a two parameter motion model.

3.6. Inference Engine

The inference engine assigns the background/foreground labels to the current image regions
based on the results of the motion estimation module. The decision rules are the
following:
• The regions projected completely inside the previous frame foreground mask are
assigned to the foreground.
• For regions that are only partial projected inside the previous frame foreground mask
(outlier regions) the pixels inside and respectively outside the mask are identified and
the colour distance between each partition and the surrounding regions is computed
according to equation (2). The ratio of the distances is computed. The region is a
classified as background or foreground if the ration exceeds a given threshold used to
define the robustness of the tracking procedure.

4. Conclusions and Future Work

The tracking results obtained with the algorithm described in the previous section are
illustrated in Figure 3. The results indicate that the algorithm is a suitable starting point
for work on supervised object segmentation and tracking. Future work on this particular
algorithm will focus on developing a fully automatic mode of the algorithm, refining the
process of motion estimation and investigating a statistical approach to combining
individual segmentation results to produce a final object segmentation.
Future work in the QIMERA project itself will focus on collaborative work on a
more sophisticated inference engine, integrating additional analysis tools, adding
segmentation evaluation metrics and providing an enhanced interface for module
communication and integration.

Figure 3. Segmentation results for the MPEG-4 test sequences: “Foreman”, “Mother
and daughter”, “Table tennis”, at frames 1, 14, 23.
6

Acknowledgments

This material is based upon work supported by the IST programme of the EU in the
project IST-2000-32795 SCHEMA. The support of the Informatics Research Initiative of
Enterprise Ireland is gratefully acknowledged.

References

[1] N. O’Connor, S. Marlow, “Supervised semantic object segmentation and tracking via
EM-based estimation of mixture density parameters”, Proceedings NMBIA’98
(Springer-Verlag), pp. 121-126, Glasgow, July 1998.
[2] F. Marques, B. Margotegui, F. Mayer, “Tracking areas of interest for content-based
functionalities in segmentation-based video schemes”, IEEE International Conference
on Accoustic, Speech and Signal Processing, vol. 2, pp. 1224-1227, May 1996.
[3] E. Tuncel, L. Onural, “Utilization of the recursive shortest spanning tree algorithm for
video-object segmentation by 2-D affine motion modelling”, IEEE Transactions on
Circuits and Systems for Video Technology, vol. 10, no.5, August 2000.

View publication stats

kamal-handbook-1st-edition
No ratings yet
kamal-handbook-1st-edition
98 pages
C# for Beginners: Learn in 24 Hours
From Everand
C# for Beginners: Learn in 24 Hours
Alex Nordeen
No ratings yet
Aphelion Software: Unlocking Vision: Exploring the Depths of Aphelion Software
From Everand
Aphelion Software: Unlocking Vision: Exploring the Depths of Aphelion Software
Fouad Sabry
No ratings yet
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Image Classification: Step-by-step Classifying Images with Python and Techniques of Computer Vision and Machine Learning
From Everand
Image Classification: Step-by-step Classifying Images with Python and Techniques of Computer Vision and Machine Learning
Mark Magic
No ratings yet
Blender Pro Studio Advanced Techniques for Real-World Projects: Blender, #3
From Everand
Blender Pro Studio Advanced Techniques for Real-World Projects: Blender, #3
Steven Mcananey
No ratings yet
Machine Vision: Insights into the World of Computer Vision
From Everand
Machine Vision: Insights into the World of Computer Vision
Fouad Sabry
No ratings yet
Smart Camera: Revolutionizing Visual Perception with Computer Vision
From Everand
Smart Camera: Revolutionizing Visual Perception with Computer Vision
Fouad Sabry
No ratings yet
Image Segmentation: Unlocking Insights through Pixel Precision
From Everand
Image Segmentation: Unlocking Insights through Pixel Precision
Fouad Sabry
No ratings yet
Computerised Systems Architecture: An embedded systems approach
From Everand
Computerised Systems Architecture: An embedded systems approach
S Mathioudakis
No ratings yet
Content Based Video Retrieval
No ratings yet
Content Based Video Retrieval
22 pages
Graph Layout Support for Model-Driven Engineering
From Everand
Graph Layout Support for Model-Driven Engineering
Miro Spönemann
No ratings yet
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
Visual Sensor Network: Exploring the Power of Visual Sensor Networks in Computer Vision
From Everand
Visual Sensor Network: Exploring the Power of Visual Sensor Networks in Computer Vision
Fouad Sabry
No ratings yet
Rendering Computer Graphics: Exploring Visual Realism: Insights into Computer Graphics
From Everand
Rendering Computer Graphics: Exploring Visual Realism: Insights into Computer Graphics
Fouad Sabry
No ratings yet
Modeling and Simulation of Discrete Event Systems
From Everand
Modeling and Simulation of Discrete Event Systems
Byoung Kyu Choi
No ratings yet
AI for Everyone: An Intermediate Guide to Artificial Intelligence
From Everand
AI for Everyone: An Intermediate Guide to Artificial Intelligence
Nova Clarke
No ratings yet
Edge Cloud Operations: A Systems Approach
From Everand
Edge Cloud Operations: A Systems Approach
Larry L Peterson
No ratings yet
Mango Grading Based On Size Using Image Processing
No ratings yet
Mango Grading Based On Size Using Image Processing
40 pages
Chapter 1: Introduction 1.1 Introduction To Project
No ratings yet
Chapter 1: Introduction 1.1 Introduction To Project
12 pages
Distance Fog: Exploring the Visual Frontier: Insights into Computer Vision's Distance Fog
From Everand
Distance Fog: Exploring the Visual Frontier: Insights into Computer Vision's Distance Fog
Fouad Sabry
No ratings yet
Get Visualizing Streaming Data Interactive Analysis Beyond Static Limits First Edition Aragues PDF Ebook With Full Chapters Now
100% (2)
Get Visualizing Streaming Data Interactive Analysis Beyond Static Limits First Edition Aragues PDF Ebook With Full Chapters Now
52 pages
Using Vocals Determine Human Emotion
From Everand
Using Vocals Determine Human Emotion
Faiz ul haque Zeya
No ratings yet
Chapter 2 SDS: Topic-Exam Seating Arrangement System
No ratings yet
Chapter 2 SDS: Topic-Exam Seating Arrangement System
23 pages
Live Trace Visualization for System and Program Comprehension in Large Software Landscapes
From Everand
Live Trace Visualization for System and Program Comprehension in Large Software Landscapes
Florian Fittkau
No ratings yet
Object Detection: Advances, Applications, and Algorithms
From Everand
Object Detection: Advances, Applications, and Algorithms
Fouad Sabry
No ratings yet
Automatic Target Recognition: Advances in Computer Vision Techniques for Target Recognition
From Everand
Automatic Target Recognition: Advances in Computer Vision Techniques for Target Recognition
Fouad Sabry
No ratings yet
Real-Time Critical Systems
From Everand
Real-Time Critical Systems
Jordan Lee Mauro-Buhagiar
3/5 (1)
PDF Visualizing streaming data interactive analysis beyond static limits First Edition Aragues download
100% (3)
PDF Visualizing streaming data interactive analysis beyond static limits First Edition Aragues download
65 pages
Requirement Analysis & Specification
No ratings yet
Requirement Analysis & Specification
10 pages
Sat - 57.Pdf - Depression Detection Using Ocr and NLP
No ratings yet
Sat - 57.Pdf - Depression Detection Using Ocr and NLP
11 pages
Multimedia IR
No ratings yet
Multimedia IR
14 pages
SE Mod 2 1
No ratings yet
SE Mod 2 1
7 pages
SAD10e Ch06 Solutions To Exercises
No ratings yet
SAD10e Ch06 Solutions To Exercises
42 pages
Podman Essentials: Definitive Reference for Developers and Engineers
From Everand
Podman Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
System
No ratings yet
System
7 pages
Color Profile: Exploring Visual Perception and Analysis in Computer Vision
From Everand
Color Profile: Exploring Visual Perception and Analysis in Computer Vision
Fouad Sabry
No ratings yet
Architectural Styles
No ratings yet
Architectural Styles
65 pages
Automatic Target Recognition: Fundamentals and Applications
From Everand
Automatic Target Recognition: Fundamentals and Applications
Fouad Sabry
No ratings yet
Digital Image Processing: Fundamentals and Applications
From Everand
Digital Image Processing: Fundamentals and Applications
Fouad Sabry
No ratings yet
Learn C++
From Everand
Learn C++
Aishik Dutta
No ratings yet
Image Compression: Efficient Techniques for Visual Data Optimization
From Everand
Image Compression: Efficient Techniques for Visual Data Optimization
Fouad Sabry
No ratings yet
sample Project Report
No ratings yet
sample Project Report
57 pages
Design and Implementation of Domestic News Collection System
No ratings yet
Design and Implementation of Domestic News Collection System
20 pages
Chapter 3
No ratings yet
Chapter 3
11 pages
FreeWill_AI01
No ratings yet
FreeWill_AI01
9 pages
Modeling and Detection of Camouflaging Worm
No ratings yet
Modeling and Detection of Camouflaging Worm
37 pages
Computer Vision Graph Cuts: Exploring Graph Cuts in Computer Vision
From Everand
Computer Vision Graph Cuts: Exploring Graph Cuts in Computer Vision
Fouad Sabry
No ratings yet
Multi View Three Dimensional Reconstruction: Advanced Techniques for Spatial Perception in Computer Vision
From Everand
Multi View Three Dimensional Reconstruction: Advanced Techniques for Spatial Perception in Computer Vision
Fouad Sabry
No ratings yet
FPGA Based Remote Object Tracking For Real-Time Control
No ratings yet
FPGA Based Remote Object Tracking For Real-Time Control
6 pages
Object Tracking System
No ratings yet
Object Tracking System
6 pages
Mesh Generation: Advances and Applications in Computer Vision Mesh Generation
From Everand
Mesh Generation: Advances and Applications in Computer Vision Mesh Generation
Fouad Sabry
No ratings yet
MAXON CINEMA 4D R16 Studio: A Tutorial Approach, 3rd Edition
From Everand
MAXON CINEMA 4D R16 Studio: A Tutorial Approach, 3rd Edition
Prof. Sham Tickoo
No ratings yet
Morpheus Soft
No ratings yet
Morpheus Soft
14 pages
Software Development Plan
No ratings yet
Software Development Plan
11 pages
A INTRODUCTION TO SYSTEM ANALYSIS AND DESIGN - Topic 1
No ratings yet
A INTRODUCTION TO SYSTEM ANALYSIS AND DESIGN - Topic 1
12 pages
Computer Graphics: Exploring the Intersection of Computer Graphics and Computer Vision
From Everand
Computer Graphics: Exploring the Intersection of Computer Graphics and Computer Vision
Fouad Sabry
No ratings yet
Java™ Programming: A Complete Project Lifecycle Guide
From Everand
Java™ Programming: A Complete Project Lifecycle Guide
Nitin Shreyakar
No ratings yet
Postprint DRAC
No ratings yet
Postprint DRAC
25 pages
Comprehensive Guide to Zipkin: Definitive Reference for Developers and Engineers
From Everand
Comprehensive Guide to Zipkin: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Pyramid Image Processing: Exploring the Depths of Visual Analysis
From Everand
Pyramid Image Processing: Exploring the Depths of Visual Analysis
Fouad Sabry
No ratings yet
AutoCAD Diploma
No ratings yet
AutoCAD Diploma
183 pages
Olivetti M300-30 Motherboard Settings and Configuration
No ratings yet
Olivetti M300-30 Motherboard Settings and Configuration
5 pages
Downloaded From Manuals Search Engine
No ratings yet
Downloaded From Manuals Search Engine
76 pages
20-Testing Fundamentals Test Plan-12-03-2024
No ratings yet
20-Testing Fundamentals Test Plan-12-03-2024
32 pages
FINAL_RHINO1
No ratings yet
FINAL_RHINO1
2 pages
Ict 112 2ND Quarter Exam
No ratings yet
Ict 112 2ND Quarter Exam
30 pages
Sairam SE
No ratings yet
Sairam SE
80 pages
Computer System 1.1
No ratings yet
Computer System 1.1
24 pages
Event Driven and Structured Driven Difference
No ratings yet
Event Driven and Structured Driven Difference
7 pages
Online Chapter One
No ratings yet
Online Chapter One
19 pages
Software Engg Question Bank 2021
No ratings yet
Software Engg Question Bank 2021
8 pages
4 1 MWagner GPU Volta
No ratings yet
4 1 MWagner GPU Volta
36 pages
SDLC Template Sup Spec
No ratings yet
SDLC Template Sup Spec
7 pages
Week-2-ppt
No ratings yet
Week-2-ppt
55 pages
CISP Appa
No ratings yet
CISP Appa
4 pages
CYBS Artifical Intelligence
No ratings yet
CYBS Artifical Intelligence
6 pages
Module of Fundamental of Computer Application.
No ratings yet
Module of Fundamental of Computer Application.
3 pages
AN-IND-1-011 Using CANoe NET API
No ratings yet
AN-IND-1-011 Using CANoe NET API
25 pages
Cabide Ui 5th Edition Plug in Installation and Whats New Guide v1 1 en
No ratings yet
Cabide Ui 5th Edition Plug in Installation and Whats New Guide v1 1 en
3 pages
Full MP Lectures PDF
No ratings yet
Full MP Lectures PDF
214 pages
Led Blink With Switch: 1152EC255-Embedded C Laboratory
No ratings yet
Led Blink With Switch: 1152EC255-Embedded C Laboratory
7 pages
A. Check If The ME Firmware Update Is Needed
No ratings yet
A. Check If The ME Firmware Update Is Needed
3 pages
SCSR4114 Internship Report Law Ji Wen
No ratings yet
SCSR4114 Internship Report Law Ji Wen
52 pages
Titania SlidesCarnival
No ratings yet
Titania SlidesCarnival
30 pages
Afs Andrew File System
No ratings yet
Afs Andrew File System
28 pages
ps300b inst manual
No ratings yet
ps300b inst manual
164 pages
AutoCAD TRAINING Presentation Rev2
No ratings yet
AutoCAD TRAINING Presentation Rev2
75 pages
M73 Tiny PDF
No ratings yet
M73 Tiny PDF
1 page
HPE_sd00002413en_us_HPE Alletra Storage MP B10000_ Replacing an interface card
No ratings yet
HPE_sd00002413en_us_HPE Alletra Storage MP B10000_ Replacing an interface card
14 pages

Download

Uploaded by

Download

Uploaded by

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

QIMERA: a software platform for video object segmentation and tracking

Article · April 2003

Noel O’Connor Tomasz Adamek

SEE PROFILE SEE PROFILE

The user has requested enhancement of the downloaded file.

N. O’CONNOR1, T. ADAMEK, S. SAV, N. MURPHY AND S. MARLOW

In this paper we present an overview of an ongoing collaborative project in the

Figure 1. System Configuration Figure 2. User Interaction Interface

3. A Semi-automatic Segmentation Algorithm using QIMERA

3.1. User interaction

3.2. Tracking strategy

3.3. Colour segmentation module

3.4. Initial Mask Module

d ( ur , lr ) - is the distance to a labeled region.

3.5. Motion Estimation Module

3.6. Inference Engine

4. Conclusions and Future Work

View publication stats

You might also like