A globally leading technology company is seeking an experienced Data Engineer to support large-scale data operations for machine learning workflows. You will work closely with external data vendors and internal teams to ingest, validate, curate, and organize high-quality datasets, enabling downstream ML model development. This role requires a strong background in Python and experience working with AWS S3-based pipelines. All qualified candidates are welcome to apply!
Job Responsibilities:
• Collaborate with external data collection vendors to track and ingest incoming datasets.
• Design and execute robust data validation and curation pipelines to ensure data quality and consistency.
• Implement logic to bin and categorize data according to project-specific criteria.
• Run pseudo-labeling workflows on newly ingested data using pre-trained ML models.
• Maintain clear status and versioning of datasets throughout their lifecycle.
• Distribute and deliver validated data assets to various internal product and ML teams.
• Maintain logs and reports to ensure traceability and accountability across data operations.
Candidate Requirements:
• 5+ years of industry experience in data engineering, data pipelines, or ML infrastructure.
• Strong proficiency in Python, including data processing and scripting.
• Experience working with AWS S3 for managing and organizing large-scale datasets.
• Familiarity with data quality assurance and curation processes.
• Comfortable operating in Unix/Linux environments, with familiarity in using command-line tools.
• Strong communication and coordination skills, especially when collaborating with external vendors and distributed teams.
• Self-driven, organized, and able to handle multiple data workflows in parallel.
Nice to Have:
• Experience with ML pipelines, especially pseudo-labeling or active learning.
• Familiarity with data versioning tools or frameworks (e.g., DVC, LakeFS).
• Prior experience in managing vendor relationships or annotation workflows.
• Speak multiple languages
Type: Contract
Duration: 12 months (with a possibility to extend)
Work Location: Seattle, WA (On site)
Pay Rate: $ 68.00 - $ 83.00 (DOE)
...Schneider Electric is looking to hire a IT Integration Intern for Spring 2026 in our Raleigh, NC location. Schneider Electric... ...our commitment to ethics, safety, sustainability, quality and cybersecurity, underpinning every aspect of our business and our willingness...
Job Description Job Description Project Manager - Civil Construction Location : Brownsville, TX Build Legacy. Lead with Integrity. Grow with Purpose. Persons Services is a dynamic and rapidly expanding construction firm, proudly operating across the United...
...Description UW Health is seeking a Nurse Practitioner Interventional Radiology for a job in MADISON, Wisconsin. Job Description &... ...to patients. \n We are seeking a Nurse Practitioner/Physician Assistant to: \n \n Practice in a dynamic and challenging subspecialty...
...Trucking is looking to add drivers to our Lease Operator division. Our program allows... ...and choose your loads. It is a no credit check / no money down lease program. If youre interested... ...If youre looking for a lease purchase that you can earn great money with and...
...Salesforce QA Tester Location: San Mateo, CA Duration: 18-24 Months Job Description: Bachelor's degree in Computer Science or related field of study. 6+ years of Quality Assurance experience with 4 years of exclusive SFDC QA experience. 3+ years' experience...