- PhD in Electrical and Computers Engineering, FEUP, May 2006
Title of the thesis: "Metadata Assisted Image Segmentation"
Abstract:
For over a century the process of capturing electronic images has
remained virtually unchanged, with each pixel in the image being a
discrete sample of the spatial and temporal continuum being
photographed. In a conventional camera, the only recorded information
for each pixel is position and colour. The fact is that captured
images remain very limited samples of the scene they represent, only
giving a 2D impression of the 3D spatial build-up of the scene. This
primitive process of capture is the cause of problems in a myriad of
applications, ranging from the film and television post-production
to virtual reality, or object-based video formats and
industrial inspection. Although much effort has been put into
surmounting these problems, all these approaches are based on the
estimation of data that is simply not included in the discrete samples
provided by digital images, and so are limited in the quality they can
provide. The capture of additional data is a step forward to address
these problems.
The study of enhanced image segmentation techniques is critical given
that image segmentation is an ubiquitous operation, spanning a large
set of applications, and that subsequent processes rely heavily on its
performance. The availability of additional data that is synchronous
with colour information should significantly boost the performance of
current state of the art techniques, fostering a whole class of new
applications.
This dissertation focuses on the study of image segmentation techniques
assisted by metadata. In particular, the use of depth and motion
information to improve the quality of segmentations of colour images is
investigated. Starting from primitive fusion approaches for colour and
depth, the work evolves to richer techniques, with the joint modelling
of colour and depth information; the the method is further
extended to conveniently handle the noise typical of real-life depth
data. Next, an alternative approach is presented, where depth is used
for providing a crude identification of the objects in the image and
colour is then used to attain high-quality borders, performing a guided
image segmentation starting from the crude seeds obtained from the
depth information. The study is concluded with the integration of
motion information in the most promising fusion techniques.
Because a fair judgment of any new image segmentation algorithm needs a
fair comparison metric, a preliminary study on metrics for comparing
image segmentations was conducted in the first place. The poor
performance of existing measures led to an exhaustive investigation on
new solutions, culminating on the rediscover of the metrics based on
the intersection-graph of two segmentations and on their introduction
to the image engineering community for the first time.
In the numerous experiments that are reported, experimental evidence of
the adequacy of the metrics and enhanced images segmentation techniques
is provided.
- MSc in Mathematical Engineering, FCUP, November 2005.
Title of the thesis: "Classification of Ordinal Data".
Abstract:
Predictive learning has traditionally been a standard inductive
learning, where different subproblem formulations have been identified.
One of the most representative is classification, consisting on the
estimation of a mapping from the feature space into a finite class
space. Depending on the cardinality of the finite class space we are
left with binary or multiclass classification problems. Finally, the
presence or absence or a “natural” order among classes will
separate nominal from ordinal problems.
Although two-class and nominal classification problems have been
dissected in the literature, the ordinal sibling has not yet received a
lot of attention, even with many learning problems involving
classifying examples into classes which have a natural order. Scenarios
in which it is natural to rank instances occur in many fields, such as
information retrieval, collaborative filtering, econometric modeling
and natural sciences.
Conventional methods for nominal classes or for regression problems
could be employed to solve ordinal data problems; however, the use of
techniques designed specifically for ordered classes yields simpler
classifiers, making it easier to interpret the factors that are being
used to discriminate among classes, and generalises better. Although
the ordinal formulation seems conceptually simpler than nominal, some
technical difficulties to incorporate in the algorithms this piece of
additional information – the order – may explain the
widespread use of conventional methods to tackle the ordinal data
problem. This dissertation addresses this void by proposing a
nonparametric procedure for the classification of ordinal data based on
the extension of the original dataset with additional variables,
reducing the classification task to the well-known two-class problem.
This framework unifies two well-known approaches for the classification
of ordinal categorical data, the minimum margin principle and the
generic approach by Frank and Hall. It also presents a probabilistic
interpretation for the neural network model. A second novel model, the
unimodal model, is also introduced and a parametric version is mapped
into neural networks. Several case studies are presented to assert the
validity of the proposed models.
- BSc (Licenciatura - 5 years degree) in Electrical and
Computers Engineering, specialization in Telecommunications, FEUP, July
1999.
Title of the final project: "Aplicação de hardware baseado em FPGA's ao processamento de imagens em tempo real".
Abstract:
A dedicated processor for block-matching motion estimation is presented.
It is based on a FPGA computing platform and uses a cumulant matching
criteria. Important performance gains have been achieved in comparison
with equivalent software implementations.
|