Education and Academic positions
- 2017 : Junior Member of Institut Universitaire de France (IUF), promotion 2017
- 2016 : Habilitation à Diriger des Recherches [abstract and full PDF]
- 2012- Present : Associate Professor (Maitre de Conference) at Ecole normale supérieure
- 2009 - 2012 : Postdoc Fellow
- Research team : Department of Computational Molecular Biology (Gene Regulation Group) headed by Martin Vingron, within Max Planck Institute for Molecular Genetics, Berlin Germany.
2008 - 2009 : Postdoc
- Research team : Laboratoire de Bioinformatique des Génomes et Réseaux (BiGRe) - (Genomes and Networks Bioinformatics) headed by Jacques van Helden, Brussels, Belgium.
- 2004 - 2008 : PhD in Bioinformatics
Medina-Rivera A*, Defrance M*, Sand O* et al, NAR, 2015
Thomas-Chollier M et al, Nature Protocols, 2012
Thomas-Chollier M et al, NAR, 2012
Thomas-Chollier M et al, NAR, 2011
Thomas-Chollier M*, Sand O* et al, NAR, 2008
Medina-Rivera A, Abreu-Goodger C, Thomas-Chollier M et al, NAR, 2011
Sand O, Thomas-Chollier M, van Helden J.Bioinformatics, 2009
Sand O, Thomas-Chollier M et al. Nature Protocols, 2008
Thomas-Chollier M*, Turatsinze J-V* et al. Nature Protocols, 2008
Thomas-Chollier M et Ledent V, BMC Genomics, 2008
Thomas-Chollier M et al, BMC Bioinformatics, 2007
My projects often involve predictions of binding regions for transcription factors (motif detection, de-novo motif discovery, ChIP-seq, ChIP-exo) and I am an active developper of RSAT.
I also have a strong interest for the evo-devo field, in particular the evolution of Hox and ParaHox protein sequences accross metazoans. My PhD thesis was focused on the Hox genes.
I analyze next-generation sequencing data. Programs to analyze these data are changing in parallel with the fast improvements of sequencing technologies, making the work of a bioinformatician very dynamic ! On a technical level, I am interested by the technology of Web services.
Peer-reviewed international journals* = equal contributions
# = co-corresponding authors
- Nguyen NTT*, Contreras-Moreira B*, Castro-Mondragon JA, Santana-Garcia W, Ossio R, Robles-Espinoza CD, Bahin M, Collombet S, Vincens P, Thieffry D, van Helden J#, Medina-Rivera A#, Thomas-Chollier M#. "RSAT 2018: regulatory sequence analysis tools 20th anniversary",
Nucleic Acid Research, in press (2018) [Pubmed][Full text]
- Thierion E, Le Men J, Collombet S, Hernandez C, Coulpier F, Torbey P, Thomas-Chollier M, Noordermeer D, Charnay P, Gilardi-Hebenstreit P. "Krox20 hindbrain regulation incorporates multiple modes of cooperation between cis-acting elements",
PLoS Genet, 13(7) p. e1006903 (2017) [Pubmed][Full text]
- Castro-Mondragon JA, Jaeger S, Thieffry D, Thomas-Chollier M#, van Helden J#. "RSAT matrix-clustering: dynamic exploration and redundancy reduction of transcription factor binding motif collections.",
Nucleic Acid Research, in press (2017) [bioRxiv][Pubmed][Full text]
- Love MI, Huska MR, Jurk M, Schöpflin R, Starick SR, Schwahn K, Cooper SB, Yamamoto KR, Thomas-Chollier M, Vingron M, Meijsing SH. "Role of the chromatin landscape and sequence in determining cell type-specific genomic glucocorticoid receptor binding and gene regulation.",
Nucleic Acid Research, 45:1805-1819 (2016) [Pubmed][Full text]
- Schöne S, Jurk M, Helabad MB, Dror I, Lebars I, Kieffer B, Imhof B, Rohs R, Vingron M, Thomas-Chollier M#, and Meijsing SH#. "Sequences flanking the core binding site modulate glucocorticoid receptor structure and activity",
Nature Communications, 7: 12621 (2016) [Pubmed][Full text]
- Telorac J, Prykhozhij SV, Schöne S, Meierhofer D, Sauer S, Thomas-Chollier M#, Meijsing SH#. "Identification and characterization of DNA sequences that prevent glucocorticoid receptor binding to nearby response elements",
Nucleic Acid Research, 44(13):6142-6156 (2016) [Pubmed][Full text]
- Hossan T, Nagarajan S, Baumgart SJ, Xie W, Tirado Magallanes R, Hernandez C, Chiaroni P, Indenbirken D, Spitzner M, Thomas-Chollier M, Grade M, Thieffry D, Grundhoff A, Wegwitz F, Johnsen SA. "The Histone Chaperone SSRP1 is Essential for Wnt Signaling Pathway Activity During Osteoblast Differentiation",
Stem Cells 34(5):1369-76 (2016) [Full text]
- Thomas-Chollier M, Martinez P. "The origin of metazoan patterning systems and the role of ANTP-class homeobox genes".
eLS, John Wiley Sons Ltd, Chichester.(2016) [Full text]
- Medina-Rivera A*, Defrance M*, Sand O*, Herrmann C, Castro-Mondragon J, Delerce J, Jaeger S, Blanchet C, Vincens P, Caron C, Staines DM, Contreras-Moreira B, Artufel M, Charbonnier – Khamvongsa L, Hernandez C, Thieffry D, Thomas-Chollier M#, van Helden J# "RSAT 2015 : Regulatory Sequence Analysis Tools",
Nucleic Acid Research 43(W1):W50-W56 (2015) [Pubmed][Full text]
- Starick S*, Ibn-Salem J*, Jurk M*, Hernandez C, Love MI, Chung H, Vingron M, Thomas-Chollier M#, Meijsing SH#
"ChIP-exo signal associated with DNA-binding motifs provide insights into the genomic binding of the glucocorticoid receptor and cooperating transcription factors" ,
Genome Research 25(6):825-35 (2015) [Pubmed][Full text]
- Hudry B, Thomas-Chollier M, Volovik Y , Duffraisse M, Dard A, Dale F, Technau U, Merabet S
"Molecular insights into the origin of the Hox-TALE patterning System",
eLife 3:e01939 (2014) [Pubmed][Full text]
- Thomas-Chollier M*, Watson L* , Cooper S, Pufall MA, Liu JS, Borzym K, Vingron M, K.R Yamamoto, SH Meijsing
"A naturally occurring single amino acid insertion rewires transcriptional regulation by Glucocorticoid receptor isoforms",
Proc. Natl. Acad. Sci. U. S. A. 110(44):17826-31 (2013) [Pubmed][Full text]
- Thomas-Chollier M, Darbo E, Herrman C, Defrance M, Thieffry D, van Helden J.
"A complete workflow for the analysis of full-size ChIP-seq (and similar) data sets using peak-motifs",
Nature Protocols 7, 1551-1568 (2012) [Pubmed][Full text]
- Thomas-Chollier M, Herrman C, Defrance M, Sand O, Thieffry D, van Helden J.
"RSAT peak-motifs: motif analysis in full-size ChIP-seq datasets",
Nucleic Acids Research 40(4) (2012) [Pubmed][Full text]
- Thomas-Chollier M, Hufton A, Heining M, O'Keeffe S, El Masri N, Roider HG, Manke T, Vingron M.
"Transcription factor binding predictions using TRAP for the analysis of ChIP-seq data and regulatory SNPs",
Nature Protocols 6, 1860-69 (2011) [Pubmed]
- Thomas-Chollier M, Defrance M, Medina-Rivera A, Sand O, Herrman C, Thieffry D, van Helden J.
"RSAT 2011: Regulatory Sequence Analysis Tools",
Nucleic Acids Research 39(Web Server issue):W86-91 (2011)[Pubmed][Full text]
- Medina-Rivera A, Abreu-Goodger C,Thomas-Chollier M, Salgado H, Collado-Vides J, van Helden J.
"Theoretical and empirical quality assessment of transcription factor-binding motifs",
Nucleic Acids Research 39(3):808-24 (2011) [Pubmed] [Full text]
- Thomas-Chollier M, Ledent V, Leyns L, Vervoort M.
"A non-tree-based comprehensive study of metazoan Hox and ParaHox genes prompts new insights into their origin and evolution",
BMC Evol Biol 10:73 (2010)[Pubmed] [Full text] Highly accessed
- Sand O, Thomas-Chollier M, van Helden J.
"Retrieve-ensembl-seq: user-friendly and large-scale retrieval of single or multi-genome sequences from Ensembl",
Bioinformatics 25(20):2739-2740 (2009)[Pubmed][Full text]
- Thomas-Chollier M*, Turatsinze J-V*, Defrance M, van Helden J.
"Using RSAT to scan genome sequences for transcription factor binding sites and cis-regulatory modules",
Nature Protocols 3(10), 1578-88 (2008) [Pubmed]
- Sand O, Thomas-Chollier M, Vervisch E, van Helden J.
"Analyzing multiple datasets by interconnecting RSAT programs via SOAP Web Services - an example with ChIP-chip data",
Nature Protocols 3(10), 1604-15 (2008)[Pubmed]
- Thomas-Chollier M*, Sand O*, Turatsinze J-V, Janky R, Defrance M, Vervisch E, Brohee S, van Helden J
"RSAT: Regulatory Sequence Analysis Tools",
Nucleic Acids Research 36 (Web Server issue):W119-W127 (2008)[Pubmed] [Full text]
- Thomas-Chollier M, Ledent V
"Comparative phylogenomic analyses of teleost fish Hox gene clusters: lessons from the cichlid fish Astatotilapia burtoni: comment",
BMC Genomics, 9:35 (2008) [Pubmed] [Full text]
- Thomas-Chollier M, Leyns L, Ledent V
"HoxPred: automated classification of Hox proteins using combinations of generalised profiles",
BMC Bioinformatics, 8: 247 (2007) [Pubmed] [Full text]
- Simionato E, Ledent V, Richards G, Thomas-Chollier M, Kerner P, Coornaert D, Degnan BM, Vervoort M.
"Origin and diversification of the basic helix-loop-helix gene family in metazoans: insights from comparative genomics.",
BMC Evol Biol, 7:33 (2007) [Pubmed] [Full text] Highly accessed
International and National conferences
- Invited speaker at INSERM workshop 245 Enhancer structure and function April 2017 (Bordeaux, France).
- Invited speaker at 6th Nuclear Receptor Meeting May 2016 (Paris, France).
- Invited speaker at Journées COMATEGE-SeqBio November 2015 (Orsay, France).
- Invited speaker at Bringing maths to Life October 2015 (Naples, Italy).
- Journees Ouvertes de Biologie, Informatique et Mathematiques (JOBIM2015) July 2015 (Clermont-Ferrand, France).
- Sixth Annual RECOMB/ISCB conference on Regulatory and Systems Genomics November 2013 (Toronto, Canada).
- The next NGS challenge May 2013 (Valencia, Spain).
- Invited speaker at INSERM workshop 212 Bioinformatics approaches to decipher genome regulation from high-throughput data October 2011 (Bordeaux, France)
- Invited speaker at 15th international Evolutionary Biology Meeting September 2011 (Marseille, France)
- EED2010 - 3rd Euro Evo Devo Conference July 2010 (Paris, France).
- Classes at Ecole normale supérieure :
- L3 : Ateliers méthodologie
- L3 : Génétique – Génomique – Bioinformatique
- L3 : Introduction aux sciences du vivant (Biologie pour non biologistes)
- M1 : Computational biology project
- M1+M2 : Mathematics and programming training
- M1+M2 : Methodology Tutorials
- M2: Computational analysis of cis-regulatory sequences
- Trainings for researchers:
- ChIP-seq data analysis (UAEM, Cuernavaca, Mexico ; National University of Singapore ; VIB Bits, Leuven, Belgium)
- Initiation au traitement des données de génomique obtenues par séquençage à haut débit (Ecole de bioinformatique AVIESAN IFB, Roscoff)
Thrilling, as in less than a decade (barely since my PhD), we have been propelled into the ”Big Data” era of Biology [Stephens et al., 2015]. Improvements in sequencing technologies have led to an explosion of Genomics data. These billions of Terabytes (”Zettabytes”) of sequence data are raising challenges for computer scientists : data compression and storage, accessibility and distribution, development of more efficient algorithms to process these large datasets. The challenge for bioinformaticians is to keep up with these perpetual new developments to obtain biological insights from all these datasets, bridging the gap between computer scientists and experimental biologists. In just a few years, the global demand in bioinformatics skills has exploded, with several job advertisements posted every single day, solely in France (source: www.sfbi.fr) !
Today, it is obvious that there are not enough bioinformaticians. It has become ordinary to be approached by experimental biologists desperate to find ”someone to analyse their data”. That is when the frustration comes in, as bioinformaticians are too often considered as a mere service provider, contacted once the raw data are already produced to apply routine pipelines, regardless of the fact that most projects require customised analyses [Chang, 2015]. Frustration also comes from the lack of consensual definition of ’bioinformatician’ [Smith, 2015]. Within the spectrum of bioinformaticians, I came to consider myself as a computational (or dry) biologist, motivated by biological questions and using a computer as my bench. In this new Big Data era, collaboration between wet and dry biologists is becoming the new standard. Bioinformaticians should be involved early in the experimental design, and fair co-authorship on the publications should be customary. Evaluation criteria should be adapted for bioinformatician careers [Chang, 2015], acknowledging that working with multiple collaborators on very diverse biological questions is actually a sign of success rather than dispersion. The evaluation criteria need to be broadened to not only include the production of scientific software, but also recognize the maintenance of these software for the community [Singh Chawla, 2016]. Last, the frustration also comes when reading high-impact journal articles that have questionable and often unreproducible bioinformatic data analyses. During the peer-reviewing process, editors should enforce policies to ask reviewers if the manuscript should be sent to a bioinformatics specialist, similar to the policies often in place for statistics.
Training in bioinformatics has become crucial in recent years. On the one hand, by providing courses and training material [Lewitter, 2006] dedicated to researchers, to alleviate the current bottleneck of sequence data analysis. It is also important to provide user-friendly computer tools to experimentalists, who have the biological expertise to analyse their data, but often lack bioinformatics skills. On the other hand, it is necessary to engage the undergraduate biology students into interdisciplinary work and computational biology, so that the next generation of biologists and clinicians will have essential bioinformatics skills [Brazas et al., 2014].
Even if this dissertation focuses on my research work, teaching takes a huge part of my activity and motivation to be associate professor. I am gladly contributing to the above-mentioned teaching aspects by (i) my engagement in the AVIESAN/IFB school of bioinformatics for researchers, as well as in various trainings for biologists (Belgium, France, Singapore), (ii) developing usable bioinformatics tools (mainly RSAT) and training users via published protocols and workshops, (iii) as vice-president of the French Society of Bioinformatics (SFBI), co-organising the first national meet- ing dedicated to the teaching of bioinformatics at the undergraduate level, and (iv) at ENS, teaching computational biology to all biology students, and introduce them to the current challenges of the Big Data era.
ReferencesBrazas, M. D., Lewitter, F., Schneider, M. V., van Gelder, C. W. G., and Palagi, P. M. (2014). A Quick Guide to Genomics and Bioinformatics Training for Clinical and Public Audiences. PLoS computational biology, 10(4):e1003510.
Chang, J. (2015). Core services: Reward bioinformaticians. Nature, 520(7546):151–152.
Lewitter, F. (2006). Welcome to plos computational biology “education”. PLoS computational biology.
Singh Chawla, D. (2016). The unsung heroes of scientific software. Nature, 529(7584):115–116.
Smith, D. R. (2015). Broadening the definition of a bioinformatician. Frontiers in genetics, 6:258.
Stephens, Z. D., Lee, S. Y., Faghri, F., Campbell, R. H., Zhai, C., Efron, M. J., Iyer, R., Schatz, M. C., Sinha, S., and Robinson, G. E. (2015). Big Data: Astronomical or Genomical? PLoS Biology, 13(7):e1002195.
When I started my PhD in bioinformatics, I ordered a Mac laptop as my working machine. When I received it, a senior postdoc in the lab told me "hey, you got a Linux for Barbie !".
Probably a great joke for the guys, but that day, I painfully discovered that being a woman would make a difference. That I would need to prove my skills to be considered. That I was not necessarily welcome in "their circle". After this, I have been extremely lucky with three open-minded mentors, who supported and guided me, lowering this bad feeling that nevertheless grew back as I became "older" in research.
Ten years and many discussions with female (and male !) scientists later, I'm convinced that this feeling is shared by many young women scientists, not just in bioinformatics, within France and abroad. I wish to do something about it, at my own level. As part of the SFBI, I have contributed to awareness of the low number of female invited speakers and chairs at Jobim, and trying to improve this. This page (in construction) aims at sharing some information I gathered for a talk on women in sciences I gave as a postdoc in Berlin. I would like to be more formally involved in the matter of equal opportunities in science, not sure yet in which form.