"Data Engineer"
Apply NowCompany: United Software Group, Inc.
Location: Milford, CT 06460
Description:
SQL/ PYTHON/ pyspark is a must .
Responsibilities
Contribute to the design and growth of our Data Products and Data Warehouses around Engagement and Retention Analytics
Work with team to design and develop scalable data warehousing solutions, building ETL pipelines in Big Data environments (cloud, on-prem, hybrid)
Our tech stack includes Hadoop, AWS, Snowflake, Spark and Airflow and languages include Python, Scala
Be an active participant and advocate of agile/scrum practice to ensure health and process improvements for your team
Basic Qualifications
1+ years of data engineering experience developing large data pipelines
Strong Python programming skills
Strong SQL skills and ability to create queries to extract data and build performant datasets
Hands-on experience with distributed systems such as Spark, Hadoop (HDFS, Hive, Presto, PySpark) to query and process data
Preferred Qualifications
Experience with at least one major MPP or cloud database technology (Snowflake, Redshift, Big Query)
You are a problem solver with strong attention to detail and excellent analytical and communication skills
Required Education
Bachelor's or master's degree in computer science, Information Systems or related field
Responsibilities
Contribute to the design and growth of our Data Products and Data Warehouses around Engagement and Retention Analytics
Work with team to design and develop scalable data warehousing solutions, building ETL pipelines in Big Data environments (cloud, on-prem, hybrid)
Our tech stack includes Hadoop, AWS, Snowflake, Spark and Airflow and languages include Python, Scala
Be an active participant and advocate of agile/scrum practice to ensure health and process improvements for your team
Basic Qualifications
1+ years of data engineering experience developing large data pipelines
Strong Python programming skills
Strong SQL skills and ability to create queries to extract data and build performant datasets
Hands-on experience with distributed systems such as Spark, Hadoop (HDFS, Hive, Presto, PySpark) to query and process data
Preferred Qualifications
Experience with at least one major MPP or cloud database technology (Snowflake, Redshift, Big Query)
You are a problem solver with strong attention to detail and excellent analytical and communication skills
Required Education
Bachelor's or master's degree in computer science, Information Systems or related field