May 19, 2021

Data Engineer III

  • Sema4
  • Stamford, CT

Job Description

Sema4 is a patient-centered health intelligence company founded on the idea that more information, deeper analysis, and increased engagement will improve the diagnosis, treatment, and prevention of disease. Sema4 is dedicated to transforming healthcare by building dynamic models of human health and defining optimal, individualized health trajectories, starting in the areas of reproductive health and oncology. Centrellis™, our innovative health intelligence platform, is enabling us to generate a more complete understanding of disease and wellness and to provide science-driven solutions to the most pressing medical needs. Sema4 believes that patients should be treated as partners, and that data should be shared for the benefit of all. As a Data Engineer within our Enterprise Information and Delivery team, you will play a vital part in driving Sema4’s future data needs and enable Sema4 to use its data for strategic purposes. You will work alongside data management, software development, data science and DevOps teams to build and optimize data management, engineering tools and interfaces to support internal and external data and analytics project and collaborations. Also, you will help in identifying opportunities for effective and efficient data capture, translating governance directives into technical solutions and ensuring they get adopted across the organization. As a Data Engineer - Lead is experienced in BI and data science development and implementation, data architecture, data visualization and communication, ETL layers, and performance tuning. With an emphasis on effective collaboration with key stakeholders, the Sr. Engineer is responsible for the assessment of business requirements, collection and identification of technical specifications, and the subsequent development of technical solutions. RESPONSIBILITIES: Be the guiding force for stakeholders in design and architecture discussions, helping the engineering teams make key technology choices, and staying associated with the use case through its development lifecycle Manage and optimize processes for data intake, validation, mining, and engineering as well as modeling, visualization, and communication Expert data skills, including complex queries, performance tuning, expertise in a variety of approaches (e.g., relational, dimensional, unstructured) Strong skills in design and implementation of logical and physical approaches to managing and analyzing large volumes of data, with knowledge of best practices Develop and maintain data integration solutions (including ETL design and architecture), semantic layer objects, presentation objects, reports, and dashboards for delivery of data for data structuring and sharing projects Build awareness, increase knowledge sharing and drive adoption of modern technologies and architecture patterns, sharing customer and engineering benefits to gain buy-in Operate as a trusted advisor for data transformation and machine learning ecosystem, helping to shape use cases and implementation in an integrated manner Ability to produce high quality documentation of business and system requirements, system design, data architecture, and training materials Participate in research and development efforts (proof of concept, prototype) as a subject matter expert when introducing new technologies Ensure technology solutions are production ready and meet the defined specifications and that the solution can be maintained via production support methodologies and resources Provide ongoing support and maintenance of deployed ETL\ELT and data pipeline solutions QUALIFICATIONS: Bachelor’s Degree or Master’s in Information Technology, Data Analysis, Computer Science or equivalent focus, or equivalent experience in lieu of degree Minimum of 10 years’ experience Strong, clear communication with ability to tell stories with data and analytics Proven track record of successfully delivering large data-centric projects. Write high quality, well tested, well designed and documented code Strong knowledge of databases (relational and non-relational) Experience with various data and technology stacks (SQL, Cloud(AWS), ETL\ELT Tools, Reporting and Data Visualization Tools) Experience with a Database Management System (Sql-Server, Oracle, MySql) Intermediate experience with SQL, Python, R, .NET, C# or another industry standard tool used in data validation and reconciliation Startup experience is a major plus Healthcare industry a plus

Apply Now