Big Data Lead
Apply NowCompany: Hexaware Technologies
Location: New York, NY 10025
Description:
Job Description
Mandatory Skills - IDQ and Informatica Cloud. Strong understanding of IDQ features and Informatica Cloud, including data profiling, cleansing rules, fuzzy matching, and data quality metrics. Ability to understand data structures and relationships within different data sources. Expertise in writing SQL queries to extract and manipulate data from various databases. Basic understanding of scripting languages like Python could be beneficial for automation tasks. Designing and developing data quality rules, cleansing routines, and validation checks within Informatica IDQ. Implementing and managing IDQ processes on a cloud platform, including setting up connections to cloud data sources, deploying data quality jobs, and monitoring their execution. Performing data profiling to identify data quality issues, such as missing values, duplicates, invalid formats, and outliers. Utilizing fuzzy matching techniques to identify and merge similar records across different data sources. Applying data cleansing rules to correct data inconsistencies, standardize formats, and address data quality issues. Contributing to data governance practices by defining and enforcing data quality standards. Working with business stakeholders to understand data quality requirements and translate them into technical specifications within the IDQ environment.
Mandatory Skills - IDQ and Informatica Cloud. Strong understanding of IDQ features and Informatica Cloud, including data profiling, cleansing rules, fuzzy matching, and data quality metrics. Ability to understand data structures and relationships within different data sources. Expertise in writing SQL queries to extract and manipulate data from various databases. Basic understanding of scripting languages like Python could be beneficial for automation tasks. Designing and developing data quality rules, cleansing routines, and validation checks within Informatica IDQ. Implementing and managing IDQ processes on a cloud platform, including setting up connections to cloud data sources, deploying data quality jobs, and monitoring their execution. Performing data profiling to identify data quality issues, such as missing values, duplicates, invalid formats, and outliers. Utilizing fuzzy matching techniques to identify and merge similar records across different data sources. Applying data cleansing rules to correct data inconsistencies, standardize formats, and address data quality issues. Contributing to data governance practices by defining and enforcing data quality standards. Working with business stakeholders to understand data quality requirements and translate them into technical specifications within the IDQ environment.
