• Principal Data Scientist

    Job Location(s) US-IN-Indianapolis | US-Nationwide
    Job Number
    Job Category
    Information Technology
    Position Type
  • Job Overview

    Informatics at Covance is a high profile, high impact team that focuses on creating innovative data-driven solutions to improve the speed, cost and quality of drug development. As a market leader in central laboratory and pre-clinical services and top 5 provider of phase III clinical trial management services, Covance has assembled the most comprehensive investigator and clinical lab database in the pharmaceutical industry, spanning >11,000 protocols, >600 indications, >175,000 unique investigators, and >14 million patient visits over the past 10 years. The role of the informatics team is to lead breakthrough innovations that will unlock the power of this data to help pharmaceutical and biotechnology companies bring the miracles of medicine to the people in need faster and more efficiently.

    As a key member of the Informatics team, you will be responsible for managing and executing important R&D projects, while providing thought leadership along with significant personal contributions. As a recognized domain expert, you will develop analytic approaches, tools and methodologies that will utilize operational, clinical, health outcomes and social networking data to address questions such as whether it is feasible to recruit patients to the current study design, how to optimize the selection of clinical sites and investigators based on historical performance data, how to predict patient enrollment based on epidemiology and disease prevalence in specific geographies, how to optimize study design to increase probability of success, how to assess operational and safety risks, and how to optimize the utilization of resources to ensure patient safety, optimal trial execution, and maximum operational efficiency. In addition to data analysis responsibilities, you will also contribute to creating state-of-the-art software infrastructure that will allow for the efficient capture, processing and dissemination of data for wider use within the company and our external clients. Working in a highly collaborative environment, you will generate key insights that influence business decisions, drive product innovation, and partner with engineering teams to build and launch data-enabled capabilities and product offerings. You will be active in the data sciences community and contribute to attracting, retaining and growing the best talent in a performance-driven organization.

    We seek smart, focused, passionate self-starters who bring energy, new ideas and practical experience to a fast-paced and dynamic team, are obsessed with delivering useful and elegant solutions, and care deeply about their customers and each other.


    • Discover stories told by data and present them to others through rich and intuitive visualizations.
    • Develop novel ways of integrating, mining and visualizing diverse, high dimensional and poorly curated data sets.
    • Formulate, implement, test, and validate predictive models, and implement efficient automated processes for producing modeling results at scale.
    • Write production quality code while implementing your own ideas. Work closely with engineering teams and participate in the full development cycle from product inception, research, and prototyping to release in production.
    • Measurably impact company performance by delivering high quality, scalable data products.
    • Contribute to bid defenses and present capabilities to internal and external clients.
    • Maintain external visibility through publications and conference presentations.
    • Demonstrate very strong technical and thought leadership and ability to influence and guide the work of others.


    • PhD in computational, physical or life sciences with a strong quantitative focus, or BS/MS with equivalent experience.



    • 8+ years of hands-on experience in the analysis, modeling, and visualization of large, complex, and heterogeneous data sets post BS (may include academic training).
    • Deep understanding and hands-on experience with statistics, data mining, machine learning and optimization techniques.
    • Excellent understanding of algorithms (both statistical and otherwise) and their scalability.
    • Expert programming skills in C#, C++, R, Python, Matlab, SAS, or equivalent.
    • Proficiency with relational databases and SQL.
    • Excellent interpersonal and communication skills, with strong written and verbal presentation.




    • Experience with NoSQL databases and other big data technologies.
    • Experience with web development.
    • Experience with life science and/or healthcare data.


    Sorry the Share function is not working properly at this moment. Please refresh the page and try again later.
    Share on your newsfeed

    Tell Us About Yourself

    Not ready to apply? Connect with us to join our talent community.