Postdoc Position in Machine learning tools to combine sequence and biologically heterogeneous data

Centre / Institution:
Centre for Research in Agricultural Genomics
Bioinformatics expertise:
Population Genetics

Job description

The Center for Research in Agricultural Genomics (CRAG) is looking for a Postdoc Researcher to be part of the Statistical and Population Genomics Group. For more information:

CRAG is an independent research institution engaged in leading-edge basic and applied plant and farm animal sciences. CRAG is established as a Consortium of the Spanish National Research Council (CSIC), Institute of Agrifood Research and Technology (IRTA), Autonomous University of Barcelona (UAB), and University of Barcelona (UB). The Center is located at the UAB campus, and currently hosts 200 members from across the world.

Research Programs at CRAG (from basic science to applied research using plant experimental model systems, crops and farm animals) make extensive use of large sets of genetic and genomic data, including high-throughput sequencing-based methods, high-throughput genotyping, and complete genome sequences and genome re-sequencing (


Group description:

The numerical genomics group at Centre for Research in Agricultural Genomics (CRAG) is primarily interested in the use of high throughput sequencing technology (NGS) for population and statistical genomics. Topics of particular interest is the use of sequence for genomic selection. CRAG was recently awarded a Severo Ochoa project for excellence centers in Spain. This project will be carried out in cooperation with international and national groups with a long experience in machine learning that complement our expertise.



We are looking for a Postdoc Researcher with a computational profile and an interest in biological applications.

Big data are characterized not only by their size but also by their heterogeneity and noisiness. These features apply in particular to genomic data: their size has been increasing exponentially with the advent of new sequencing technologies, but also their complexity. We aim at combining several available sources of information, not only the phenotypes and sequence data, but also, e.g., annotation features or metabolic pathways. An important goal is not only to provide efficient predictors but also tools to biologically interpret the results. We aim at developing tools that combine image and sequencing data.

The tasks are included within the framework at the project AGL2016-78709-R.

Laboratory: Dr. Miguel Pérez Enciso (

Desired skills and expertise


  • Educational requirements: PhD in Biology, Biochemistry or similar.
  • Qualifications and Experience: Programming ability and experience in python. Experience in machine learning area.

Valued Skills:

  • Strong skills in communication, organization, and interpersonal relations.
  • Experience in image analysis.

Contract duration and other benefits

We offer:

Contract duration: 12 months (extendable if funding available).
Hours/week: Full time

Required information and contact

Submission of applications:

Interested candidates, please submit:

  • Letter of motivation describing past and current projects (1.5 pages max.)
  • Full CV including a list of publications, experience and techniques.
  • Three references (including email address and phone number)

All the candidatures must be registered and send the documents at:  

Deadline: Until suitable candidate found