Location
Type, TX | United States
Job description
JOB-10040149
Anticipated Start Date
3/18/2024
Location
Houston, TX
Type of Employment
Contract Hire
Employer Info
This Company is a leading industrial technology company using material science to push boundaries in semiconductor, life sciences, and other technology-enabled sectors. They are a leader in sealing technologies, advanced surface technologies, and highly engineered materials. Their products and services are sold into more than 40 distinct end-markets that touch our lives every day – from food and pharmaceutical facilities to semiconductor clean rooms, from agricultural robots that help grow your food to last-mile technologies that deliver it to your doorstep, from commercial aviation to space exploration, and much more in between.
Job Summary
We are currently seeking an experienced Data Engineer to join the Big Data and Advanced Analytics department. The Data Engineer will work closely with business domain experts to create an Enterprise Data Lakehouse to support data analytic use cases for midstream oil and gas business units. This individual will provide analytical and technical leadership to the team to advance the data engineering practice within the organization.
- Work directly with Business domain experts and Data Scientists to develop high quality, reliable, scalable, machine learning systems
- Design and implement frameworks and tools to streamline the machine learning process
- Automate manual data collection and processing tasks to improve efficiency
- Leverage software architecture and design patterns to develop fault tolerant microservices
- Convert research-based machine learning models into production-ready software
- Implement processes to ensure coding standards, code quality, documentation, and test coverage
Skills Required
- 7+ years of programming experience in Python
- Expertise in developing and maintaining data pipelines
- Experience in testing, packaging, and deploying machine learning models
- Experience in software engineering practices such as Design Principles and Patterns, Unit Testing, Refactoring, CI/CD, and version control
- Expertise in Object-Oriented Design Principals and Functional Programming Principals
- Experience with common Python Data Engineering packages including Pandas, Numpy, Pyarrow, Pytest, Scikit-Learn, and Boto3
- Experience in implementing distributed computing systems
- Experience in designing modular, reusable software components
- Experience in developing API endpoints and microservices
- Knowledgeable of MLOps Principles
- Knowledgeable of ML platform technologies including Apache Airflow, Kubernetes, Dask, Ray, and MLFlow
Education/Training/Certifications
- High School Degree or GED
Job tags
Salary