Equifax is searching for a Senior Data Engineer to join our world-class Entity Resolution (Keying & Linking) team within the Global Data & Analytics CoE. The ideal candidate is a rare hybrid, an engineer with the programming abilities to scrape, combine, and manage data from a variety of sources and a statistician who knows how to derive insights from the information within. This role will combine the skills to create new prototypes with the creativity and thoroughness to ask and answer the deepest questions about the data. Qualified candidates will have a strong academic background in mathematics or statistics, passion for data science and data engineering.
This role requires understanding large, diverse data sources and extensive experience developing data matching and Entity Resolution rules. Linking data about consumers and/or businesses will be a large part of this role.
Equifax has a hybrid work schedule that allows for 2 days of remote work (Monday and Friday), with 3 days onsite (Tuesday, Wednesday, Thursday) every week.
This role will work the required onsite days at our Equifax office in Alpharetta, GA or Reston, VA.
Visa sponsorship or support is not available currently or in the future. No C2C. No vendors.
What you will do
Apply the knowledge of data characteristics and data supply pattern, develop rules and tracking process to support data quality model.
Prepare data for analytical use by building data pipelines to gather data from multiple sources and systems.
Integrate, consolidate, cleanse and structure data for use by our clients in our solutions.
Perform design, creation, and interpretation of large and highly complex datasets.
Stay up-to-date with the latest trends and advancements in GCP and related technologies, actively proposing and evaluating new solutions.
Understand best practices for data management, maintenance, reporting and security and use that knowledge to implement improvements in our solutions.
Implement security best practices in pipelines and infrastructure.
Develop and implement data quality checks and troubleshoot data anomalies.
Provide guidance and mentorship to junior data engineers.
Review data analysis performed by junior data engineers.
What experience you will need
BS degree in a STEM major or equivalent discipline; Master's Degree strongly preferred.
5+ years of experience as a data engineer or related role.
Advanced skills using programming languages such as Python, SQL, and NoSQL and intermediate level experience with scripting languages.
Intermediate level understanding and working experience with Google Cloud Platforms (preferred), AWS, or Azure and overall cloud computing concepts, as well as basic knowledge of other cloud environments.
Experience building and maintaining moderately-complex data pipelines, troubleshooting issues, transforming and entering data into a data pipeline in order for the content to be digested and usable for future projects.
Experience designing and implementing moderately complex data models and experience enabling optimization to improve performance.
Demonstrates advanced Git usage and CI/CD integration skills.
What could set you apart
Knowledge of credit bureau data, and familiarity with the laws like FCRA, GLBA, and GDPR.
Associate / Intermediate Cloud and/or SQL cert from GCP, DataCamp, Coursera, Microsoft, AWS, LeetCode, PluralSight, Coursera, etc.
Experience or advanced degree with focus on Entity Resolution concepts, prototype or hands-on projects related to graph, vector DB, embeddings.
Experience in Data Visualization and storytelling.
Experience collaborating with international teams.
Hands-on code development experience with strong coding ethics.