Degrees and Education
Nov. 2001 - May 2006
|
Doctorate in Computing Science, NLP / Document Retrieval
LIA, Université d'Avignon et des Pays de Vaucluse (UAPV), France
"Apprentissage et Décision Automatique en Recherche Documentaire: prédiction de difficulté de requêtes et sélection de modèle de recherche"
(Automatic learning and decision in document retrieval: prediction of query difficulty and retrieval model selection)
Doctorate thesis defended on May 31st, 2006
|
2000 - 2001
|
DEA d'Informatique (Master's Degree)
Faculté des Sciences de Luminy, Université d'Aix-Marseille II, France
|
1998 - 2000
|
Vordiplom Informatik (Undergraduate Degree in Computing Science)
Technische Universität Berlin, Germany
|
1990 - 1997
|
Secondary education (high school) and Abitur (graduation)
«Askanisches Gymnasium», Berlin, Germany
|
1994 - 1995
|
High School -- Junior Year
«American Fork High School», Utah, USA
|
Professional / Research Experience
February 2024 - ...
|
Senior Research Engineer at the Barcelona Supercomputing Center in Barcelona, Spain.
Applied Research in the Language Technologies group:
- Leveraging Large Language Models for real-world applications
- Adaptation / fine-tuning of LLMs
- Participation in European Projects
|
March 2017 - January 2024
|
Researcher at the Universitat Pompeu Fabra (TALN) in Barcelona, Spain.
Applied Research in the TALN group on a variety of NLP tasks:
- Natural Language Analysis (UIMA)
- Machine Translation
- Natural Language Generation
Head of technical infrastructure and development:
- Transition to a unified UIMA-based architecture across projects
- Implementation of Docker Swarm based service management
- Management of the group's technical infrastructure in coordination with UPF's IT services
Technical lead and coordinator of TALN's tasks in multiple EU projects (and other funding schemes):
- coordinator: TENSOR, Connexions, V4Design, Ingenious, XR4DRAMA, TecSemHu
- technical lead and researcher: beAWARE, Mindspaces, WELCOME, ALLIES, ReSilence
- contributor to proposal writing on various projects
|
Dec. 2013 - Apr. 2017
|
Researcher at the Universitat Pompeu Fabra (GLiCom) in Barcelona, Spain.
Applied Research in the GLiCom group on the topic of information extraction, retrieval and data mining:
- Natural Language Processing (UIMA)
- Semantic Search (UIMA + Solr)
- recommender systems (news, social media):
- hybrid systems: audio, social, and textual information
- analysis of social networks
Technical lead on the EUMSSI project (FP7-ICT 611057), evaluated as "excellent" by the European Commission:
- Design and implementation of a scaleable platform for multilayered multimedia analysis
- Technical coordination of development across consortium partners
|
April 2008 - Nov. 2013
|
Researcher at the Fundació Barcelona Media (Universitat Pompeu Fabra) in Barcelona, Spain.
Applied Research on the topic of information retrieval / data mining:
- recommender systems (music, interactive services, personalized offers):
- hybrid systems: audio, social, and textual information
- integrating collaborative filtering with user demographics and item features (extracted using NLP)
- analysis of social networks
- user profiling
- Natural Language Processing (UIMA)
- Semantic Search (UIMA + Solr)
|
Oct. 2006 - Dec. 2007
|
Research and Development at MTG
In the Music Information Retrieval (MIR) group of the Music Technology Group (MTG) of the Universitat Pompeu Fabra (UPF) in Barcelona, Spain.
Applied Research on the topic of music information retrieval (work conducted under NDA):
- a fuzzy matching algorithm for the fusion of music related information sources
- music recommendation systems
- playlist generation, genre classification, mood classification, labelling
- hybrid systems: audio, social, and textual information
- automatic tagging of music related content based on Wikipedia, social community, and web based information sources
- task oriented performance evaluation of MIR systems
Most of the work relates to several European Projects, such as Variazioni, Pharos, Cantata, and others
|
2001 - 2006
|
Doctorate thesis in computing science
In the «Dialogue» research group at the Laboratoire Informatique d'Avignon (LIA), University of Avignon (UAPV), under supervision from Renato de Mori (Prof. LIA-UAPV) and Pierre Jourlin (MdC. LIA-UAPV). Defended on May 31st, 2006.
Research on the topic of information retrieval:
- continuation of the master's thesis on predicting query difficulty
- application of automatic learning and classification methods for the evaluation of document retrieval system performance
- an automatic decision model for the optimization of retrieval strategies
|
2001 - 2004
|
Industrial partnership with Digitech S.A., as part of the doctorate thesis
|
February - June 2001
|
Master's thesis research project at the Laboratoire d'Informatique d'Avignon (LIA), «Évaluation et Prédiction des Difficultés de Requêtes dans la Recherche Documentaire pour l'Optimisation de Systèmes Interactifs» (Evaluation
and Prediction of Query Difficulty in Document Retrieval for the Optimization of Interactive Systems)
|
Oct. 1999 - Sept. 2000
|
Tutor in Computing Science at the TU Berlin, see below for details
|
April - Dec. 1999
|
Part time employment as a systems administrator at «Berliner Glas KG»
- transition from dial-up to permanent Internet connectivity, transfer of DNS domains previously maintained externally
- planning and implementation of a security concept for the internal network
- installation and maintenance of different servers (mail, web, samba, proxy, ...)
- user support
|
1997 - 1998
|
Civilian/social service (alternative to the otherwise mandatory military service) at the St. Joseph hospital in Berlin.
|
Teaching
Jan. - March 2016
|
Professor associat (Assistant professor)
Teaching activities at the Universitat Pompeu Fabra (UPF):
-
Anàlisi lingüistica i corpus
3rd year. Corpus linguistics
|
Jan. - Sept. 2009
|
Professor associat (Assistant professor)
Teaching activities at the Universitat Pompeu Fabra (UPF):
-
Programmació 1/2/3
1st year. Programming and algorithms (Java)
|
2004 - 2006
|
Attaché Temporaire d'Enseignement et de Recherche (ATER) (Assistant researcher/teacher)
Teaching activities at the IUP Génie Mathématique et Informatique:
-
Programming project
3rd year (Licence). Implementation of a functional language interpreter in C++
-
Information markup and structuring: XML
4th year (Mastère). VoiceXML, SMIL, Java (DOM, SAX), ...
-
Document indexing
4th year (Mastère). Document retrieval, indexing, compression, ...
-
OOP, Java / C#
4th year (Mastère). GUI programming, object models
|
2001 - 2004
|
Moniteur de l'Enseignement Supérieur (Tutor)
Teaching activities at the IUP Génie Mathématique et Informatique and the University of Avignon (UAPV):
-
Programming and algorithms in C++
2nd year (IUP 1). Algorithms, C++ practice and initiation in object oriented programming
-
Theoretic foundations of computing science
1st year (DEUG MIAS). Boolean algebras (formal approach)
-
Information structuring languages
3rd year (IUP 2). XML, XHTML, ...
-
Structure of Computers
2nd year (IUP 1). Boolean algebra, logic circuits, functioning of a simple computer.
|
1999 - 2000
|
Tutor in computing science
Teaching at the Technische Universität Berlin:
-
Functional programming
1st year. Programming in the functional language Opal
-
Hardware
1st year. From elementary electronic circuits to a "von Neumann"-computer, binary boolean algebra
-
Algorithms
1st year. Algorithms in Java
|
Computing skills
-
Programming Languages: Java, Python, C/C++, Perl, Opal, Pascal, Modula, Eiffel, Prolog, ...
-
Automatic classification and learning tools: gaia (MTG), WEKA, Mahout, D2K/M2K,
SVM light, CAL5, ...
-
Document retrieval systems: Lucene/Solr, LIA's probabilistic system (Okapi based)
-
Natural Language Processing: UIMA, NLTK, OpenNLP, ...
-
Software development methods: object modeling (UML), design patterns, functional programming, ...
-
Database systems: MongoDB, Oracle, MySQL, ...
-
Operating systems: Unix/Linux, MacOS, TOS, Windows, ...
-
Office tools: LateX, OpenOffice, AppleWorks, ...
Languages
-
German: native
-
French: native
-
English: excellent (read, written, spoken)
-
Spanish: excellent (read, written, spoken)
-
Catalan: fluent (read, listening), basic (spoken, written)
-
Portuguese (brasilian): advanced (read), basic (spoken, written)
|