Junior Data Engineer - scikit-learn

Apply Now

Company: Tech Tammina

Location: New York, NY 10025

Description:

Job Title: Data Engineer

Job Location: New York, NY (Hybrid)

Job Type: Full Time
Expected Start Date: ASAP


Job Description:

Must be strong with Python for ML pipelines specifically with Py torch and scikit-learn

AWS is required, building pipelines within

Should have a background in LLM (langchain, agents, extensive prompt engineering)

The "strong additional requirements" below are required.

Responsibilities:
  • Ingesting, structuring and analyzing a wide range of unstructured datasources
  • Designing, maintaining and orchestrating data pipelines in an AWS environment for production processing and training flows
  • Continuously evaluate, analyze, test and improve the quality, privacy and performance of our data systems
  • Contribute across the product, where - from front-end UX and product design, API/systems architecture and ML processing/training


Minimum Qualifications:
  • 3+ years of experience ingesting, analyzing and structuring a wide variety of datasources
  • Significant experience building and maintaining data pipelines in a production environment
  • Strong database/SQL, python, pandas (or equivalent) experience
  • Prior experience working in fast paced environments and tackling problems across the stack with quick iterations while maintaining a high quality bar.


Strong Additional Qualifications:
  • Significant healthcare data experience
  • LLM experience (langchain, agents, extensive prompt engineering)
  • MLE Experience - pytorch, scikit-learn, etc..
  • Extensive production AWS, container and/or data orchestration experience
  • Full stack development experience (JS/TS/Node in particular)

Similar Jobs