jrms!

Jorge Manjarrez-Sanchez

Software Engineering Group
Center for Mathematical Research (CIMAT)

hello!
http://www.cimat.mx/~jorgems/

Recent research

Parallel Content-based Retrieval in Image Databases

I have addressed the performance problem when searching in large databases of images. The processing of similarity queries is a computational challenge because of the dimensionality of the abstract representation for the images and size of the databases. I developed two data organization methods that account for performance improvement. The first one is based on the clustering of the database in centralized settings. I have derived an optimal range of values for the number of clusters to obtain from a database, which in conjunction with a searching algorithm allows to efficiently process nearest neighbor queries. However as the dimensionality and size of the database increase, a single computer is overwhelmed. The second method is based on data partitioning over a shared nothing machine. Based on the results of the first method, this method maximizes parallelism. I have also derived the optimal number of processing nodes to maximize resource utilization. I have performed extensive experiments with synthetic and real databases. They validate the proposals and show that the performance level is superior to existing approaches which beyond a certain dimensionality or database size become inefficient.

Keywords: Multimedia data management, Multidimensional data, Databases, Data clustering, Cluster and parallel computing, Data partitioning.

While on Doctoral leave at the University of Nantes, under the guidance of Prof. Patrick Valduriez and Prof. Jose Martinez.

Publications (with Jose Martinez and Patrick Valduriez)

Previous Projects

Research

Development

Detailed cv available upon request.
jrms
tumblr site counter