web statistics
Avatar

Céline Mercier

Bioinformatician

About

I’m a bioinformatics scientist and engineer specialized in biological sequence analysis.

Interests

  • Biological sequence analysis e.g. sequence alignment, clustering, taxonomic assignment
  • Methodological developments for environmental DNA data analysis
  • High-performance computing
  • Efficient data management

Education

  • PhD in Bioinformatics and Ecology, 2015

    Université Grenoble Alpes (France) and Laboratoire d’Ecologie Alpine (LECA, France)

  • MSc in Biodiversity-Ecology-Environment, 2011

    Université Grenoble Alpes (France) and University of Oulu (Finland)

  • BSc in Biology and Bioinformatics, 2009

    Université Grenoble Alpes (France)

Experience

 
 
 
 
 

Bioinformatician

Scion (New Zealand Forest Research Institute Limited)

Mar 2021 – Present Rotorua, New Zealand
 
 
 
 
 

Bioinformatics Research Engineer

Laboratoire d’Ecologie Alpine (LECA)

May 2015 – Mar 2019 Grenoble, France
Main developer of the OBITools3, a software package dedicated to DNA metabarcoding analysis.
 
 
 
 
 

PhD in Bioinformatics and Ecology

Laboratoire d’Ecologie Alpine (LECA)

Oct 2011 – Mar 2015 Grenoble, France

Methodological developments for the analysis of DNA metabarcoding data.

Main developer of Sumatra (sequence alignment program) and Sumaclust (sequence clustering program).

Projects

The OBITools3

A full analysis pipeline can be built using the OBITools3 (demultiplexing, paired-end reads alignment, dereplicating, global alignment, PCR error detection, taxonomic assignment, annotation using taxonomic information if necessary, in silico PCR…).

This new version of the OBITools looks to significantly improve the storage efficiency and the data processing speed by relying on an ad hoc database system.

Languages: C / Python / Cython

https://metabarcoding.org/obitools3

Sumatra

Sumatra computes the exact pairwise global alignment scores from one dataset or between two datasets.

Used: lossless kmer filter / optimized exact alignment / SIMD parallelization / pthreads parallelization
Languages: C

https://metabarcoding.org/sumatra

Sumaclust

Sumaclust clusters sequences using the same alignment methods as Sumatra to compute the distance between sequences. The ‘star clustering’ algorithm used is suitable for the quick clustering of a huge number of sequences, and for the detection of ‘erroneous’ sequences created during amplification and sequencing protocols, deriving from ‘true’ sequences.

Used: greedy star-clustering algorithm / lossless kmer filter / optimized exact alignment / SIMD parallelization / OpenMP parallelization
Languages: C

https://metabarcoding.org/sumaclust

Publications

An R package for the evaluation and improvement of DNA metabarcoding data quality

Ecological specialization and niche overlap of subterranean rodents inferred from DNA metabarcoding diet analysis

obitools: a unix-inspired software package for DNA metabarcoding

Open-Source Sequence Clustering Methods Improve the State Of the Art

DNA metabarcoding diet analysis for species with parapatric vs sympatric distribution: a case study on subterranean rodents