Join one of the world's leading innovators with a rich history of monumental breakthroughs.
Our specialty segment provides high quality, innovative products for life science applications enabling countless people around the world to make and deliver life changing discoveries. This includes everything from glass products used by millions and millions every day, to integral components in vaccine production and exciting future developments.
We're searching for a Data Engineer to be responsible for the architecture, implementation and governance of our Life Sciences' data lake supporting the division's centralized analytics platform.
3 Awesome Reasons To Work Here
Work in our exciting, newly formed analytics center of excellence.
Use your expertise in data to advance the division.
One of the best overall employee rewards packages around, including 401K + retirement + benefits + bonus.
What You Will Be Doing OR Job Details
Working within the business intelligence and data science team, this role will be a key partner in advancing how the division capitalizes on its data.
A successful candidate will have a track record of designing and developing reliable data ingestion pipelines from multiple process and operational data stores using both on-premise and cloud-based technologies. These pipelines will require data validation and data profiling automation using version control to ensure ongoing resiliency and maintainability of the inbound data flows supporting both business intelligence as well as advanced analytics projects.
Day To Day Responsibilities
Design, test, deploy and maintain production big-data ingestion pipelines using established frameworks, patterns of practice, agile software development and continuous delivery and/or continuous deployment (CI/CD) practices, collaborating closely with the advanced analytics platform team
Define data ingestion requirements for structured, unstructured, and semi-structured data, pilot their implementation, and ensure user acceptance
Work with data source teams, domain experts, analysts, and data scientists to define and develop data cleansing and data enrichment processes
Develop and implement data governance processes
What You Need For This Position
Bachelor's degree in Computer Science, Engineering, Math, Finance, or related discipline
5+ years of demonstrated production programming proficiency in at least one modern JVM language such as Java, as well as an interpreted declarative programming language such as Python
3+ years of experience developing batch, micro-batch and streaming ingestion pipelines using high-level Apache Spark APIs (pySpark, SparkR, and SparkSQL)
3+ years of production experience using SQL and DDL
2+ years DevOps experience with AWS platform services, such as AWS S3 & EC2, Data Migration Services (DMS), RDS, EMR, RedShift, Lambda, DynamoDB, CloudWatch, CloudTrail
Expert level proficiency with both traditional relational and polyglot persistence technologies
Experience with agile software development & continuous integration + continuous deployment methodologies along with supporting tools such as Git (Gitlab), Jira, Terraform, New Relic
We'd be really thrilled if you have any of the following:
Prior full-stack app development experience (front-end, back-end, microservices)
Familiarity with Oracle, Microsoft SQL Server, SSIS, SSRS data technologies
Established enterprise ETL and integration tools including Informatica, Mulesoft
Experience with data sources and integration solutions commonly used in manufacturing such as Pi Integrator, and Maximo
Familiarity with reporting and analysis tools such as PowerBI, Tableau, or SAS JMP
What's In It For You
In addition to a competitive salary of $125K - $165K with additional bonus and the chance to have a major impact with an industry leader, we offer comprehensive career development, and a generous benefits program, that reflects the company's commitment to supporting your financial, career development, health, and life goals.
We will assist you with relocation if necessary. Immigration support is also available for this role.
Interviews are being scheduled now. So, if you are an interested Data Engineer, please apply today.