Data Engineer Job at OSI Engineering, Seattle, WA

V0ZCRU1Id1E2bm1mK0hLTWNnNXVPWC9ISXc9PQ==
  • OSI Engineering
  • Seattle, WA

Job Description

Job Description

A globally leading technology company is seeking an experienced Data Engineer to support large-scale data operations for machine learning workflows. You will work closely with external data vendors and internal teams to ingest, validate, curate, and organize high-quality datasets, enabling downstream ML model development. This role requires a strong background in Python and experience working with AWS S3-based pipelines. All qualified candidates are welcome to apply!

Job Responsibilities:

• Collaborate with external data collection vendors to track and ingest incoming datasets.

• Design and execute robust data validation and curation pipelines to ensure data quality and consistency.

• Implement logic to bin and categorize data according to project-specific criteria.

• Run pseudo-labeling workflows on newly ingested data using pre-trained ML models.

• Maintain clear status and versioning of datasets throughout their lifecycle.

• Distribute and deliver validated data assets to various internal product and ML teams.

• Maintain logs and reports to ensure traceability and accountability across data operations.

Candidate Requirements:

• 5+ years of industry experience in data engineering, data pipelines, or ML infrastructure.

• Strong proficiency in Python, including data processing and scripting.

• Experience working with AWS S3 for managing and organizing large-scale datasets.

• Familiarity with data quality assurance and curation processes.

• Comfortable operating in Unix/Linux environments, with familiarity in using command-line tools.

• Strong communication and coordination skills, especially when collaborating with external vendors and distributed teams.

• Self-driven, organized, and able to handle multiple data workflows in parallel.

Nice to Have:

• Experience with ML pipelines, especially pseudo-labeling or active learning.

• Familiarity with data versioning tools or frameworks (e.g., DVC, LakeFS).

• Prior experience in managing vendor relationships or annotation workflows.

• Speak multiple languages

Type: Contract

Duration: 12 months (with a possibility to extend)

Work Location: Seattle, WA (On site)

Pay Rate: $ 68.00 - $ 83.00 (DOE)

Job Tags

Contract work,

Similar Jobs

Vertech

Entry Level Network Engineer - Phoenix Job at Vertech

Overview Join to apply for the Entry Level Network Engineer - Phoenix role at Vertech .Get AI-powered advice on this job and more exclusive features.A Network Engineer is responsible for the foundation of an organizations IT system and therefore the foundation of the... 

DriveTime

Logistics Coordinator Job at DriveTime

 ...right vehicle, on the right terms and on their path to ownership. You can find DriveTime's tire tracks across the nation with dealerships where we embrace a transparent pricing approach, eliminating haggling and gimmicks and reconditioning centers where we breathe new... 

United Parcel Service

Warehouse Associate Job at United Parcel Service

Seasonal Warehouse Worker UPS Quick Apply Applying for this role online is quick and easy - and you could even schedule your first day of work within 10 minutes! The steps are simple: 1) Provide some basic information to start the application process. 2) Watch a short video...

University of Georgia

Summer Camp Director Job at University of Georgia

Summer Camp Director Below you will find the details for the position including any supplementary documentation and questions you should review before applying for the opening. To apply for the position, please click the Apply for this Job link/button.If you would like... 

Carle Health

Physical Therapy Assistant (PTA) - Acute Therapy Job at Carle Health

Overview Implements plan of care as provided by the physical therapist, treating patients of all ages. Serves as a role model...  ...Field of Study Associate's Degree Physical Therapy Assistant Licenses/Certifications Licensed Physical Therapist Assistant...