About me

I started out in mechanical engineering, then moved on to study applied math, statistics and computer science. I've been analysing large-scale datasets for 6 years. Mostly genomics, but also financial, geographical and text mining data.

The above is my 3-monitor setup at the Sanger Institute, snapped by a colleague as I was doing statistical analysis dealing with some hostile aliens in Mass Effect:Andromeda. As you can see, my desktop is on the messy side. The Coke bottle is not mine, but the flags are.

The journey so far



I performed geographical information (GIS) analyses using R, in order to map the evolution of several types of biodiversity-critical habitats in an administrative region of the Republic of Congo.

April-June 2019

📰Article (Bioinformatics)

Very low depth whole genome sequencing in complex trait association studies.

View on publisher's website

December 2018

Started working as Head of Analytics

Institute of Translational Genomics, Helmholtz-Zentrum Munich, Germany

November 2018


Completed my PhD

You can read my thesis, entitled Sequencing in Isolation: Next-generation sequencing studies in founder populations on the University's online repository. Good luck!

October 2018

📰Article (Nature Communications)

Cohort-wide deep whole genome sequencing and the allelic architecture of complex traits..

View on publisher's website

November 2018


Created the Sounds of Paris website

Paris, France

May 2018

Helped organise and run the 2nd Volos Summer School of Human Genetics

Volos, Greece

April-June 2018

📰Article (Nature Communications)

Whole genome sequencing and imputation in isolated populations identify genetic associations with medically-relevant complex traits.

View on publisher's website

November 2017

Student Consultant

Cambridge Consulting Network

Along with a group of 5 other students, I acted as a consultant for a healthcare charity. We developed models for the health economics and social return on investment of Obsessive-Compulsive Disorder (OCD), and produced a report and recommendations.

April-June 2017

Helped organise and run the 1st Volos Summer School of Human Genetics

Volos, Greece

April-June 2017

📰Article (Human Molecular Genetics)

Very low-depth sequencing in a founder population identifies a cardioprotective APOC3 signal missed by genome-wide imputation.

View on publisher's website

May 2016

Started working as Principal Bioinformatician

The Wellcome Trust Sanger Institute, Hinxton, Cambridge, UK

Skills used: Python (pandas, plotly, bokeh, seaborn), R, bash (sed/awk), REST/JSON

June 2016

Started a PhD

Department of Public Health and Primary Care, University of Cambridge

April 2016

Ecohack Cambridge

UNEP-WCMC, Cambridge, UK

Ecohack is a global hackathon organized by UNEP-WCMC, an environmental consulting charity linked to the UN. During the event, environmentalists, analysts and policy consultants meet to work on .

I designed the project website and built a prototype layered map in Umap.

Nov 2014

📰Article (Nature Communications)

Genetic characterization of Greek population isolates reveals strong genetic drift at missense and trait-associated variants

View on publisher's website

Nov 2014

Cambridge Big Biology Day

Hills Road Sixth Form College, Cambridge, UK

The Big Biology Days are an initiative by the Society of Biology to bring together researchers from the life sciences and young students, as well as the broader public. The goal is to introduce children to the basic concepts of biology through play activities.

I animated the DNA sequencing, Sorting Algorithm and DNA alphabet activities.

October 2014

📰Article (Briefings in Functional Genomics)

Using population isolates in genetic association studies.

View on publisher's website

Nov 2014

Started working as Senior Bioinformatician / Statistical Geneticist

The Wellcome Trust Sanger Institute, Hinxton, Cambridge, UK

Skills used: Bash, Perl, C++ (boost libraries), Python (scipy libraries), R, Tableau

Statistics used: Logistic regression, PCA/MDS, Linear Mixed Models, hypothesis testing, clustering (k-means), methods for sparse data (imputation)

August 2013

Crossed the Channel to Cambridge, UK 🏡☔

August 2013

Article (BMC Bioinformatics)

TE-Tracker: Systematic identification of transposition events through whole-genome resequencing

March 2013 View on publisher's website

Ran statistics courses at CEA/Genoscope

Topics covered: Introduction to statistics, estimation theory, hypothesis testing, regression models, (M)AN(C)OVA, model building.</br></br> Fortnightly 2-hour sessions and practicals using R.

Autumn/Winter 2012

Started working as Research Engineer

CEA/Genoscope, Evry, Paris, France

Skills used: Bash, Perl, C++ (boost libraries)

Statistics used: clustering (single-linkage), supervised classification

January 2012

Started working as R&D Engineer (short-term contract)

Misys Sophis, Paris, France

Skills used: C#

Statistics used: Monte-Carlo methods, Stochastic processes, derivatives valuation models

August 2011

MSc in Applied Mathematics 🎓

Department of Mathematical Modelling, Image and Simulation, Grenoble INP-ENSIMAG, Grenoble, France

August 2011

Started working as Junior Front Office Consultant (intern)

Misys Sophis, Hong Kong SAR, P.R. of China 🇭🇰

Skills used: C#, Excel, SQL

January 2011

Exchange semester

Department of Bio and Brain Engineering, KAIST, Daejeon, South Korea 🇰🇷

August-December 2010

Bachelor in Engineering 🎓

Grenoble INP-ENSIMAG, Grenoble, France

August 2010

Born 👶♉

Toulouse, France

April 1989