Data Scientist (Text mining: understanding impact)
Short info about job
Company: European Molecular Biology Laboratory (EMBL)
Department: Literature Services Team
Salary: Grade 5 or 6 (monthly salary starting at £2,552 or £2,856 after tax).
Hours: Full Time
Contract type: Fixed-Term/Contract
Type / Role: Professional or Managerial
Phone: +44-1294 4927886
Fax: +44-1253 2377685
Detail information about job Data Scientist (Text mining: understanding impact). Terms and conditions vacancy
Contract Duration: 2 years
For more information about pay and benefits click here
Job DescriptionWe are seeking to recruit a data scientist with text mining skills to join the Literature Services Team at the European Bioinformatics Institute (EMBL-EBI) located on the Wellcome Trust Genome Campus near Cambridge in the UK. This post is a fixed-term post to undertake text and data mining projects that support investigations into the impact of funded research and research data infrastructure.
Europe PMC is the database of life sciences abstracts and full text articles that incorporates both PubMed and PMC content, holding over 30 million abstracts and 4.2 million full text articles), and is supported by 27 funders of life sciences research, for whom we also run a public database of awarded grants. In addition to providing powerful search and retrieval mechanisms for the content such as section-level searching, we integrate the articles with ORCIDs, supporting data, funding information and other resources that provide relevant information for our users. This represents a large collection of material for data mining on which to explore the impacts of research funding and use of research data infrastructure. We are therefore looking for a versatile data scientist capable of developing production-quality text and data mining algorithms to support impact analysis.
Specific job responsibilities include:
- Develop algorithms that mine full text research papers for organisation names and grant IDs for Europe PMC funders
- Extend algorithms for mining database accession numbers, resource names or other means of gathering indicators of use of data resources from the literature
- Iterative improvement of solutions, with key stakeholders, and analysis of results
- Development of interfaces that support easy access to the results of this work
At EMBL-EBI, we help scientists realise the potential of ‘big data’ in biology by enabling them to exploit complex information to make discoveries that benefit mankind. Working for EMBL-EBI gives you an opportunity to apply your skills and energy for the greater good.
Qualifications and ExperienceThe successful candidate should demonstrate some or most of the following:
- Experience of text-mining as applied to biological data resources in an academic, industrial or publishing setting;
- Technical ability e.g. Perl, Java, R, XML parsing;
- Flexible approach and ability to take on new skills;
- Self starter and able to manage multiple projects;
- Team player and good communicator
BenefitsEMBL is an inclusive, equal opportunity employer offering attractive conditions and benefits appropriate to an international research organisation. The remuneration package comprises a competitive salary, a comprehensive pension scheme and health insurance, educational and other family related benefits where applicable, as well as financial support for relocation and installation.
Application InstructionsTo apply please submit a covering letter and CV, with two referees, through our online system.
Additional InformationApplications are welcome from all nationalities - visa information will be discussed in more depth with applicants selected for interview.
EMBL-EBI is committed to achieving gender balance and strongly encourages applications from women, who are currently under-represented at all levels. Appointment will be based on merit alone.
This position is limited to the project duration specified.