PySpark Remote Jobs

Find remote jobs requiring PySpark skills. Apply now and work from anywhere.

PySpark is the Python interface for Apache Spark. It lets you write Python code to process and analyze very large datasets across multiple machines. Typical tasks include cleaning and transforming data, batch and stream processing, and running machine learning workflows.

This skill is useful for remote work because many PySpark jobs run on cloud clusters you can access from anywhere. You can share notebooks and code through Git, schedule and automate pipelines, and collaborate asynchronously with teammates. Employers look for people who can design reliable, reproducible workflows and troubleshoot distributed systems without being onsite.

Industries that commonly use PySpark include:

Technology and software platforms
Finance and insurance
Healthcare and life sciences
Retail and e-commerce
Advertising and media analytics
Telecommunications and utilities

To develop this skill, start with strong Python fundamentals and basic data engineering concepts. Practice on a local Spark setup, then move to cloud-managed clusters to learn deployment and scaling. Build projects that use dataframes, structured streaming, and Spark ML, and share your code on GitHub. Read the official docs, follow tutorials, and participate in community forums to deepen your knowledge.

When applying for remote roles, highlight concrete examples: a pipeline you built, a performance problem you fixed, or a model you trained with Spark. Describe the tools you used to collaborate, test, and monitor jobs so hiring managers can see how you will contribute to a distributed team.

Data Engineer

Mexico

3 days ago

AWS

data modeling

Data Pipelines

Blend360

Full-Time

Experienced

Databricks Practice Lead

Guadalajara, Mexico

5 days ago

Azure

Data Engineering

Databricks

KMS Technology

Full-Time

Manager

Senior Data Engineer

Málaga, Spain

1 week ago

Databricks

Git

PySpark

Talan

Temporary

Experienced

Cientista de Dados II

São Paulo, Brazil

1 week ago

Data Manipulation

Machine Learning

PySpark

Experian

Full-Time

Experienced

Cientista de Dados III

São Paulo, Brazil

1 week ago

Feature Learning

Machine Learning

PySpark

Experian

Full-Time

Experienced

Senior Data Scientist, Innovation Lab

San Diego, CA

2 weeks ago

Deep Learning

Generative AI

Machine Learning

Experian

Full-Time

Experienced

Data Modeling Expert, Fraud Analytics Consulting

United States

3 weeks ago

AWS

data modeling

Machine Learning

Experian

Full-Time

Expert

Senior Data Analytics Consultant

Kraków, Poland

3 weeks ago

data modeling

ETL/ELT

PySpark

InPost

Contractor

Experienced

Middle Data Engineer (Azure Databricks)

Constanța, Romania

1 month ago

Azure Data Factory

Azure Databricks

Delta Lake

Miratech

Full-Time

Experienced

Senior Data Scientist, Innovation Lab

San Diego, CA

1 month ago

Deep Learning

Generative AI

Machine Learning

Experian

Full-Time

Experienced

Data Engineer (Azure Databricks)

Łódź, Poland

1 month ago

Azure Data Factory

Azure Databricks

Delta Lake

Miratech

Full-Time

Experienced

Senior Databricks Architect

Houston, TX

1 month ago

Cloud Platforms (Azure/AWS/GCP)

Databricks

Delta Lake

Cystems Logic Inc

Contractor

Expert

Sr. Software Engineer II - Data Solutions & Measurement

Remote

1 month ago

Java

Kafka

PySpark

Cint

Full-Time

Experienced

Director, Fraud Analytics Consulting

Costa Mesa, CA

2 months ago

AWS

Fraud Analytics

Machine Learning

Experian

Full-Time

Senior Manager

Senior Data Engineer

Buenos Aires, Argentina

2 months ago

AWS (Glue, S3, Athena, Step Functions)

Dbt

PySpark

Blend360

Full-Time

Expert

Sr. Software Engineer II - Data Solutions & Measurement

Czech Republic

2 months ago

Apache Spark

Backend Engineering

Kafka

Cint

Full-Time

Expert

Senior Data Engineer (AI & AWS)

São Paulo, Brazil

2 months ago

Apache Airflow

AWS

PySpark

Blend360

Part-Time

Expert

Senior Data Scientist - Innovation Lab

San Diego, CA

2 months ago

Deep Learning

LLMs / Generative AI

Machine Learning

Experian

Full-Time

Experienced

Data Engineer (Azure Databricks)

Zaragoza, Spain

2 months ago

Azure Data Factory

Azure Databricks

Delta Lake

Miratech

Full-Time

Experienced

Middle Data Engineer - Azure Databricks

Katowice, Poland

2 months ago

Azure Data Factory

Azure Databricks

Delta Lake

Miratech

Full-Time

Experienced

Sr. Databricks Architect

Houston, TX

2 months ago

Apache Spark

Databricks

Delta Lake

Cystems Logic Inc

Contractor

Expert

Data Engineer Analyst

Montevideo, Uruguay

2 months ago

AWS Glue

CDC/Streaming Ingestion

PySpark

Blend360

Full-Time

Experienced

Senior Data Engineer

Bogotá, Colombia

2 months ago

AWS (Glue, S3, Athena, Step Functions)

Data Modeling And CDC

PySpark

Blend360

Full-Time

Expert

Data Engineer Analyst

Santiago, Chile

2 months ago

AWS (Glue, S3, Athena, Step Functions)

CDC / Streaming Ingestion

PySpark

Blend360

Full-Time

Experienced

Data Engineer

Bogotá, Colombia

2 months ago

AWS (Glue, S3, Athena)

ETL & Data Pipeline Architecture

PySpark

Blend360

Full-Time

Experienced

Senior Data Engineer

Bogotá, Colombia

2 months ago

AWS (Glue, S3, Athena, Step Functions)

Data Pipeline Architecture & CDC

PySpark

Blend360

Full-Time

Expert

Senior Data Modeling Analyst

Costa Mesa, CA

2 months ago

AWS

Machine Learning

PySpark

Experian

Full-Time

Expert

Software Engineer - ETL & Automation Testing

Chennai, India

2 months ago

Automation Testing

Databricks

PySpark

NielsenIQ

Full-Time

Experienced

Senior Data Scientist - Operations

Kraków, Poland

2 months ago

Data Analysis

Machine Learning

PySpark

InPost

Full-Time

Expert

Software Engineer - ETL & Automation (PySpark, Databricks)

Chennai, India

2 months ago

Automation Testing

Databricks

PySpark

NielsenIQ

Full-Time

Experienced