Senior Data Engineer

Apply Now

Company: Alembic

Location: San Francisco, CA 94112

Description:

About Alembic

Alembic is a fast-growing Series A software startup focused on building cutting-edge solutions that transform how businesses harness and leverage data. We are a team of innovators, engineers, and product leaders passionate about solving complex problems with scalable, data-driven technology. At Alembic, we believe that great software is built by great people, and we are looking for a Data Engineer who thrives in a fast-paced, high-impact environment.

About the Role

As a Data Engineer at Alembic, you will be at the core of our data platform, building scalable and reliable data pipelines, optimizing storage solutions, and enabling real-time and batch analytics. You will work closely with data scientists, software engineers, and product leaders to design and implement robust data architectures.

Key Responsibilities
  • Design, develop, and maintain scalable ETL pipelines that ingest, process, and transform large volumes of structured and unstructured data.
  • Optimize data storage solutions using modern data lakehouse architectures and best practices for cost, performance, and reliability.
  • Collaborate with data scientists and engineers to integrate machine learning models and analytical workloads into production environments.
  • Ensure data integrity, quality, and security by implementing monitoring, alerting, and governance best practices.
  • Work with cloud-based data warehouses and distributed data processing frameworks.
  • Continuously evaluate and implement new technologies to improve data infrastructure and operational efficiency.


What We're Looking For
  • 10+ years of experience in data engineering, software engineering, or a related field.
  • Strong expertise in SQL and Python for data processing.
  • Experience with modern data warehousing and lakehouse solutions (i.e. Iceberg or similar).
  • Proficiency in working with distributed systems andbig data technologies (Apache Spark, Hadoop, Kafka, Flink).
  • Hands-on experience with cloud platforms (AWS, GCP, Azure) and related data services.
  • Deep understanding of data modeling, database design, and performance optimization.
  • Familiarity with CI/CD pipelines, containerization (Docker, Kubernetes), and infrastructure-as-code (Terraform, CloudFormation) for data pipelines.
  • Strong problem-solving skills, with a passion for building reliable, scalable, and maintainable data systems.
  • Excellent communication skills and the ability to collaborate in a cross-functional team.


Nice to Have
  • Experience with Graph Databases, NoSQL, or Time-Series Databases.
  • Familiarity with data privacy, governance, and compliance (GDPR, HIPAA, SOC 2).
  • Experience with machine learning pipelines and MLOps.


Why you might be excited about Alembic:
  • You want to build something that is both technologically challenging and solves a real customer need. You want a role with major upside that tackles a massive market opportunity.
  • You are a serial startup builder or want to learn more before becoming a founder yourself. Our team holds deep experience building and selling B2B marketing solutions that work.
  • You want to work where you can take a big swing at building something big while maximizing your personal growth.


Why you might not be excited:
  • If you only want to tell people instead of building and coding alongside them - and on your own - then we're not the environment for you.
  • You prefer company practices with 100% built out process for every little detail.
  • You prefer static over dynamic. Projects, priorities, and roles will adapt to your skill set and your goals. Though we have a playbook for growth, we proudly remain an early stage startup.

Similar Jobs