Lead Data Engineer Job at WorkHQ, Los Angeles, CA

Yko2OVZVVDFXZnF3WUdVbmh6aS90eUkzMXc9PQ==
  • WorkHQ
  • Los Angeles, CA

Job Description

Company Context

Series A, well-funded US startup in HRTech developing WorkHQ.com and an AI Recruiter product.

This is a US-only, Remote role (Mainland).

Role Overview

Lead data infrastructure architect managing billions of data points across 250M+ professional profiles.

Hire data engineers to aid you in that journey.

Core Responsibilities

  • Design scalable data pipelines processing massive record volumes

  • Architect ETL processes using PySpark on Amazon EMR (Open to shifting to other solutions like Data Bricks / Snowflake)

  • Distribute enriched data through medallion architecture across Postgres, Athena, OpenSearch

  • Integrate new data sources into the main pipeline

  • Implement advanced data matching using Splink

Technical Requirements

  • 5-8 years professional data engineering experience

  • Good proficiency in:

    • PySpark and distributed computing

    • AWS data services (EMR, Glue, Athena)

    • Docker

    • Pandas and DataFrame manipulation

    • Complex data format handling (JSONL, Parquet)

  • Strong background in:

    • Big data processing architectures

    • Data warehouse design

    • Performance optimization

  • Advanced Python, SQL skills

Nice to Have

  • Probabilistic record linking expertise

  • OpenSearch/elasticsearch technologies

  • Machine learning data pipeline design

  • Recruitment tech ecosystem knowledge

Technical Stack

  • Big Data: PySpark, EMR

  • Databases: Postgres, OpenSearch

  • Cloud: AWS

  • Containerization: Docker

  • Data Formats: JSONL, Parquet

  • Analytics: Metabase, Athena, Glue

  • Data Processing: Pandas, Splink

Other Considerations

While this role has specific requirements - if you lack a few technical skills, but motivated to learn and lead the platform, please apply for consideration.

If you are coming from Director/Head of/VP levels that is relevant to this job, you can apply as well.

You will need to apply directly on our platform.

Thank you for your time.

Job Tags

Permanent employment, Remote work, Shift work,

Similar Jobs

Garney Construction

Safety Manager Job at Garney Construction

 ...GARNEY CONSTRUCTION As Safety Manager position in Nashville, TN, at Garney Construction, you will be responsible for ensuring safety is the number one priority on our water and sewer pipeline projects. WHAT YOU WILL BE DOING Review, implement, and assist... 

Compass Pennsylvania, LLC

Real Estate Operations Coordinator Job at Compass Pennsylvania, LLC

 ...organized, creative, and eager to grow in real estate?Compass Pennsylvania, LLC is seeking a Real Estate Operations Coordinator to join our team in Haverford, PA. This hybrid...  ...and proactive back-office support for all transactions Coordinate and prepare listings for the... 

HCA Healthcare

Wound Care RN Job at HCA Healthcare

 ...other healthcare provider. We are seeking a Part-Time Out patient Wound Care Registered Nurse to join our healthcare family.Benefits...  ...Apply today for our NICU Nurse opportunity.Out Patient Wound Care RN - Part-TimeUnder the supervision of the Wound Center Manager,... 

Tata Consultancy Services

Robotics Engineer Job at Tata Consultancy Services

 ...experience in auto-labeling pipelines and dataset generation for AI models Publications, patents, or projects in simulation-based robotics development are a plus Develop and maintain simulation environments using NVIDIA Isaac Sim and Omniverse Integrate CAD... 

Shinebask Technologies LLC

Millwright rigging estimator Job at Shinebask Technologies LLC

 ...Full Time/Permanent Position_ Millwright & Rigging Estimator Location_ Buffalo, NY, United States Please share your resume...  ...accredited institution (or equivalent education and/or field experience). Heavy industrial steel mill experience is preferred. ...