Full-Time Principal Data Engineer
Seamless.AI is hiring a remote Full-Time Principal Data Engineer. The career level for this job opening is Experienced and is accepting USA based applicants remotely. Read complete job description before applying.
Seamless.AI
Job Title
Posted
Career Level
Career Level
Locations Accepted
Share
Job Details
The Opportunity:At Seamless.AI, we’re seeking a highly skilled and experienced Principal Data Engineer with expertise in Python, Spark, AWS Glue, and other ETL (Extract, Transform, Load) technologies.
Responsibilities:
- Design, develop, and maintain robust and scalable ETL pipelines to acquire, transform, and load data.
- Collaborate with cross-functional teams to understand data requirements.
- Implement data transformation logic using Python.
- Utilize AWS Glue to create and manage ETL jobs.
- Optimize and tune ETL processes for large data sets.
- Apply methodologies for data matching, deduplication, and aggregation.
- Implement data governance practices.
- Collaborate with the data engineering team to explore new technologies.
Skillset:
- Strong proficiency in Python and related libraries (e.g., pandas, NumPy, PySpark).
- Hands-on experience with AWS Glue.
- Solid understanding of data modeling, data warehousing, and data architecture.
- Expertise in working with large data sets and distributed computing.
- Experience developing and training machine learning models.
- Strong proficiency in SQL.
- Familiarity with data matching, deduplication, and aggregation.
- Experience with data governance, data security, and privacy practices.
- Strong problem-solving and analytical skills.
- Excellent communication and collaboration skills.
Education and Requirements:
- Bachelor's degree in Computer Science or related field.
- 7+ years of experience as a Data Engineer.
- Professional experience with Spark and AWS pipeline development.
Company Information:
Seamless.AI has been delivering sales leads since 2015.
Seamless.AI is an equal opportunity employer.