Posted by eduardovalle on Monday, June 21, 2010
Ana Lopes, a student of Prof. Arnaldo Araújo and Prof. Jussara de Almeida, has compiled an impressive survey on human action recognition for her Ph.D. thesis. The analysis of that corpus, especially the recent literature, has prompted us to propose a new way to categorize the existing methods, using the underlying data representation as the main criterium of organization. The abstract explains the rationale behind that choice:
This paper presents a survey of human action recognition approaches based on visual data recorded from a single video camera. We propose an organizing framework which puts in evidence the evolution of the area, with techniques moving from heavily constrained motion capture scenarios towards more challenging, realistic, “in the wild” videos. The proposed organization is based on the representation used as input for the recognition task, emphasizing the hypothesis assumed and thus, the constraints imposed on the type of video that each technique is able to address. Expliciting the hypothesis and constraints makes the framework particularly useful to select a method, given an application. Another advantage of the proposed organization is that it allows categorizing newest approaches seamlessly with traditional ones, while providing an insightful perspective of the evolution of the action recognition task up to now. That perspective is the basis for the discussion in the end of the paper, where we also present the main open issues in the area.
The survey was submitted for peer review at the CVIU, and is available as a preprint at arxiv.org.
Posted in science | Tagged: action recognition, Ana Lopes, Arnaldo Araújo, CVIU, DCC / UFMG, human actions, Jussara de Almeida, paper, publication, survey | Leave a Comment »
Posted by eduardovalle on Wednesday, April 21, 2010
Our paper, “MONORAIL: A Disk-Friendly Index for Huge Descriptor Databases” was accepted at the upcoming IAPR Internation Conference on Pattern Recognition — ICPR 2010. Here is the abstract:
We propose MONORAIL, an indexing scheme for very large multimedia descriptor databases. Our index is based on the Hilbert curve, which is able to map the high-dimensional space of those descriptors to a single dimension. Instead of using several curves to mitigate boundary effects, we use a single curve with several surrogate points for each descriptor. Thus, we are able to reduce the random accesses to the bare minimum. In a rigorous empirical comparison with another method based on multiple surrogates, ours shows a significant improvement, due to our careful choice of the surrogate points.
I am particularly proud of this paper, not only because of the method itself, but also because of the experimental design we propose for the validation. I have been studying for more than a year the topics of Design of Experiments, statistical tests and validation. This is the first of a crop of publications that are employing those rigorous evaluation tools, which, though commonplace in other fields, are still seldom used in Computer Sciences.
Posted in publications, science | Tagged: conference, design of experiments, Fernando Akune, kNN search, monorail, paper, publication, Ricardo Torres | Leave a Comment »
Posted by eduardovalle on Wednesday, April 14, 2010
I have just arrived (suitcases still to be undone) from my trip to the USA. This time, I went to Philadelphia for the MIR Conference, where I have presented a poster on the work of my student Fábio Faria. I have met many interesting people at MIR and heard exciting, new ideas from them, but (without any intention to dismiss the hard work of the organizers) I must confess I was expecting a more diverse array of works (especially considering how broad the “Multimedia” community is).
Instead, I was astonished by how much the presented selection was similar in terms of technical foundation: classification based on discriminant approach (almost always using SVM) and representation based on “bags of visual features”. It is not that those do not interest me — after all, our own work is sits squarely on those pillars — but I was very interested in hearing about, seeing other approaches: generative models based on latent or explicit semantics, representations based on constellation models — what do I know ? — perhaps something completely new, which I haven’t even heard about.
I was left wondering why those “competing theories” were so notably absent. Has the community decided that SVM + Bags of Features is so conspicuously better than everything else ? (If that is the case, I would like to know how they reached this conclusion — though I like the results given by the pair “bags + SVM”, I am far from considering the “case closed”).
Was it self-selection by the autors, who didn’t submit their works to this particularly community ?
Or — and this is obviously the worst scenario— have all the alternative works been retained at the peer review barrier, because ideological considerations have (maybe unconsciously?) tainted the assessment of quality. I would like to quick dismiss this latter possibility, but the similarity between the works was really astounding. My student Otávio Penatti, who is on his first months of Ph.D. (he was there presenting a demo of his M.Sc. work) remarked it immediately.
I was very glad, nevertheless, to have this opportunity to visit Philadelphia. It was a very moving experience for me, because it gave me a very concrete, very immediate realization of how strongly The Enlightenment was shining in America at that time.
* * *
Otávio and I have profited from our travel to the USA to visit Prof. Edward Fox in Virginia Tech, who was the former Ph.D. advisor of Otávio’s current Ph.D. avidsor and my Post-Doc advisor Prof. Ricardo Torres. We have an ongoing cooperation with Prof. Fox. In fact, while we were there, we have met a Brazilian colleague of ours, Nadia Kozievitch, who is spending an year of her Ph.D. with Prof. Fox.
While we were there, we gave a talk on our current work and got acquainted with several exciting projects Prof. Fox is conducting, on a broad array of applications of digital libraries, including identification of fingerprints, biodiversity databases, e-Science, cooperation for crisis situations, and education.
We have also met Brazilian Prof. João Setúbal, who showed us the Virginia Bioinformatics Institute, and talked about his work in genomics, and the new field of transcriptonics.
We were very impressed not only with the infra-structure of Virginia Tech, but also with the kindness and attentiveness of everyone who received us.
Posted in career, science | Tagged: paper, publication, cooperation, UNICAMP, Ricardo Torres, Fábio Faria, USA, Edward Fox, João Setúbal, Otávio Penatti, Nadia Kozievitch, Virginia Tech, bioinformatics, MIR 2010 | 2 Comments »
Posted by eduardovalle on Friday, March 5, 2010
Our new lab RECOD now has not only a cool name and logo but also money to finance its first two years of operation. We have just been informed that the Brazilian sponsoring agency FAPESP has approved our project. The project is coordinated by my post-doc supervisor Prof. Ricardo Torres and was co-authored by Prof. Anderson Rocha, Prof. Helio Pedrini, Prof. Jacques Wainer, Prof. João Cavalcanti, Prof. Siome Goldenstein and me. Prof. Pedrini (our colleague from UNICAMP) and Prof. Cavalcanti (from UFAM) entered not as members of the lab, but as cooperating partners in the project.
Posted in science | Leave a Comment »
Posted by eduardovalle on Tuesday, February 16, 2010
Together with Prof. Anderson Rocha, Prof. Jacques Wainer, Prof. Ricardo Torres (my Post Doc advisor, by the way) and Prof. Siome Goldenstein, we have recently founded a new laboratory at the Computing Institute of the State University of Campinas (UNICAMP).
The new lab — which we named RECOD — aims to embrace the research subjects of machine learning, multimedia retrieval and classification, multimodality and digital forensics.
The foundation of this new lab both celebrates a history of fruitful colaboration between its participating members and inaugurates a new phase of tighter cooperation, in which the synergy of our complementary competencies will be fostered in an optimized environment.
I cannot avoid to be proud that my colleagues have accepted both my name and logo suggestions for the new lab.
Long live RECOD !

Posted in science | Tagged: Anderson Rocha, digital forensics, Jacques Wainer, machine learning, multimedia, RECOD, Ricardo Torres, Siome Goldenstein, UNICAMP | Leave a Comment »
Posted by eduardovalle on Friday, January 22, 2010
Our paper, “Learning to Rank for Content-Based Image Retrieval” , was accepted at the upcoming ACM Multimedia Information Retrieval Conference (MIR 2010). The first author is the M.Sc. student Fábio Faria, and the paper was co-authored with my Post Doc supervisor Ricardo Torres and several of our partners from UFMG, including Marcos Gonçalves, with whom we have an ongoing cooperation.
Here is the abstract:
“In Content-based Image Retrieval (CBIR), accurately ranking the returned images is of paramount importance, since users consider mostly the topmost results. The typical ranking strategy used by many CBIR systems is to employ image content descriptors, so that returned images that are most similar to the query image are placed higher in the rank. While this strategy is well accepted and widely used, improved results may be obtained by combining multiple image descriptors. In this paper we explore this idea, and introduce algorithms that learn to combine information coming from different descriptors. The proposed learning to rank algorithms are based on three diverse learning techniques: Support Vector Machines (CBIR-SVM), Genetic Programming (CBIR-GP), and Association Rules (CBIR-AR). Eighteen image content descriptors (color, texture, and shape information) are used as input and provided as training to the learning algorithms. We performed a systematic evaluation involving two complex and heterogeneous image databases (Corel e Caltech) and two evaluation measures (Precision and MAP). The empirical results show that all learning algorithms provide significant gains when compared to the typical ranking strategy in which descriptors are used in isolation. We concluded that, in general, CBIR-AR and CBIR-GP outperforms CBIR-SVM. A fine-grained analysis revealed the lack of correlation between the results provided by CBIR-AR and the results provided by the other two algorithms, which indicates the opportunity of an advantageous hybrid approach.”
I will be travelling to Philadelphia on late March to present the poster. I am very excited about this upcoming trip to the United States, where I am to meet several friends and colleagues, but at the same time, worried about the radicalization of air security rules and the exaggeration of perception of threats. Have we got so scared to die that we decided instead not to live ?
Posted in publications, science | Tagged: paper, conference, publication, cbir, DCC / UFMG, Ricardo Torres, poster, MIR, learn to rank, machine learning, Fábio Faria, Marcos Gonçalves | Leave a Comment »
Posted by eduardovalle on Monday, January 11, 2010
While we both enjoy our Summer breaks in our hometown, my good friend Bruno Abrahão, who is now a Ph.D. candidate at Cornell University, asked me if he could, as a preparation for an upcoming exam, teach me a lesson on Advanced Programming Languages. I eagerly accepted, not only for the opportunity of spending time with him, but also for the enjoyment of learning something new.
My last proper lesson on P.L. was on my undergrads, probably more than ten years ago. Certainly, it was nothing like the one he taught me: his course covered extremely abstract topics, mostly concerning deep formal issues, like how the semantics of P.L. can be constructed. We have started with lambda calculus, and from that innocent foundation, erected a progressively complex conceptual lattice…
I don’t know if he had planned a self-contained lesson, but as it turned out, we just greedily browsed the syllabus, stopping at everything that looked noteworthy. He would ask me “let me know when you get tired” and I would answer “not yet, please, go on”. We visited a lot of material, throwing haphazardly away everything whose “interest / length of exposition” ratio seemed unpromising.
When we finally became aware of the time, four hours had passed, and we were so exhausted we could barely say our goodbyes in a proper manner.
But I am looking forward to do this again and pondering on how can I lure each one of my friends into teaching me a lesson. Maybe some Earl Gray and my famous banana cupcakes could do the trick…
Posted in learning | Tagged: Bruno Abrahão, lambda calculus, learning, programming languages | Leave a Comment »
Posted by eduardovalle on Wednesday, November 11, 2009
- I am visiting the Mathematical and Computing Sciences Instiute of the University of São Paulo (ICMC – USP) at São Carlos, on Tuesday, November 24. I’ll visit Professors Agma Traina and Caetano Traina Jr. and their team — they have a long experience on similarity search, and have proposed the Slim-Tree, a metric space technique. I’ll give a talk on the Scalability Issues of Multimedia Information Retrieval.
- That same week, I am visiting the Mathematics and Statistics Institute of the University of São Paulo (IME – USP) at São Paulo, on Friday, November 27. I’ll visit Professor Roberto Cesar Jr. and his student, the Ph.D. candidate Marcelo Hashimoto, both of whom I had the pleasure to meet on last SIBGRAPI. I’ll also give a talk there, the title will be announced soon. (EDIT 26/11: The talk will be about “Similarity search for Multimedia” and will be heavily based on my SBBD 2009 Tutorial — more details can be found on my Google Calendar).
- The slides of my (and Prof. Matthieu Cord’s) Tutorial on Similarity Search and High-Dimensional Indexing, which I gave at this year’s SBBD, are finally available.
- I am still working to make available the slides of my (and Prof. Matthieu Cord’s) Tutorial on Advanced Techniques for CBIR, which I gave at this year’s SIBGRAPI. The companion article to the tutorial, however, is already available.
Posted in publications | Tagged: kNN search, publication, cbir, seminar, sibgrapi, tutorial, SBBD, icmc-usp, ime-usp, similarity search | Leave a Comment »
Posted by eduardovalle on Sunday, September 20, 2009
I guess that for all people involved, DocEng’09 was a success. Like last year, the conference was small — I think that we were 60 or 70 participants — but the quality of the works presented was high, and the scientific exchange was extremely interesting. In DocEng, you get to meet everyone individually, something which is unfeasible at large-scale conferences.
Thematically, the conference has a broad scope, centered around the representation, processing, analysis, storage and retrieval of documents. My main research topic concerns the retrieval of multimedia documents, and is somewhat at the fringe of the conference theme. Nevertheless, people seemed genuinely interested and I’ve got many useful insights and suggestions.
* * *
I have just arrived at Paris, where I will meet my former Ph.D. supervisor Prof. Matthieu Cord, among other colleagues. I intend to advance our research on high-dimensional multimedia indexing and large scale multimedia retrieval. I am also giving a talk about my current research pursuits at the ETIS labs, on Cergy-Pontoise, next Tuesday, September 22nd.
If you use Google Calendar you can save the date by clicking below:

Posted in publications, science | Tagged: cooperation, doceng, etis, France, Germany, lip6, publication, seminar | Leave a Comment »
Posted by eduardovalle on Wednesday, September 9, 2009
My tutorial Similarity Search and Indexing for High-Dimensional Data has been accepted on SBBD 2009 (The Brazilian Symposium on Databases). Here’s the abstract:
Searching by similarity is a critical operation on many systems, and thus has attracted the attention of many disciplines in Computer Sciences, including Computational Geometry, Machine Learning, Multimedia and, of course, Databases. To perform efficiently, similarity search requires the support of indexing, which suffers from the infamous “curse of the dimensionality”. In this tutorial we will introduce the challenges of indexing and searching high-dimensional data, and present the most recent tools available to “tame the curse”. At the end, the audience will have a good grasp of the current state of the art, the most promising research trends and the challenges still faced by the technology.
The tutorials, as I understand, are open to all participants on the conference. Mine will be held on Wednesday, October 7th from 14h40 to 18h20, with a 20′ coffee-break. If you use Google calendar, you can save the date by clicking on the button below.

* * *
I’ve unintentionally let an awful lot of of time pass since my last post — the move to Campinas (and to UNICAMP) has been wonderful, but also laborious. I thought that after moving across countries three times, moving across states would be a piece of cake, but it seems that, no matter the distance, moving is always a lot of hassle!
EDIT 11/11/09: The tutorial presentation, for the moment without narrative, is available on my talks and courses page.
Posted in career, publications | Tagged: indexing, kNN search, publication, SBBD, tutorial, UNICAMP | Leave a Comment »