PySpark Remote Jobs

Find remote jobs requiring PySpark skills. Apply now and work from anywhere.

PySpark is the Python interface for Apache Spark. It lets you write Python code to process and analyze very large datasets across multiple machines. Typical tasks include cleaning and transforming data, batch and stream processing, and running machine learning workflows.

This skill is useful for remote work because many PySpark jobs run on cloud clusters you can access from anywhere. You can share notebooks and code through Git, schedule and automate pipelines, and collaborate asynchronously with teammates. Employers look for people who can design reliable, reproducible workflows and troubleshoot distributed systems without being onsite.

Industries that commonly use PySpark include:

  • Technology and software platforms
  • Finance and insurance
  • Healthcare and life sciences
  • Retail and e-commerce
  • Advertising and media analytics
  • Telecommunications and utilities

To develop this skill, start with strong Python fundamentals and basic data engineering concepts. Practice on a local Spark setup, then move to cloud-managed clusters to learn deployment and scaling. Build projects that use dataframes, structured streaming, and Spark ML, and share your code on GitHub. Read the official docs, follow tutorials, and participate in community forums to deepen your knowledge.

When applying for remote roles, highlight concrete examples: a pipeline you built, a performance problem you fixed, or a model you trained with Spark. Describe the tools you used to collaborate, test, and monitor jobs so hiring managers can see how you will contribute to a distributed team.

Middle Data Engineer (Azure Databricks)

Constanța, Romania
1 day ago
Azure Data Factory
Azure Databricks
Delta Lake
Miratech
Full-Time
Experienced

Senior Data Scientist, Innovation Lab

San Diego, CA
5 days ago
Deep Learning
Generative AI
Machine Learning
Experian
Full-Time
Experienced

Data Engineer (Azure Databricks)

Łódź, Poland
1 week ago
Azure Data Factory
Azure Databricks
Delta Lake
Miratech
Full-Time
Experienced

Senior Databricks Architect

Houston, TX
2 weeks ago
Cloud Platforms (Azure/AWS/GCP)
Databricks
Delta Lake
Cystems Logic Inc
Contractor
Expert

Sr. Software Engineer II - Data Solutions & Measurement

Remote
2 weeks ago
Java
Kafka
PySpark
Cint
Full-Time
Experienced

Director, Fraud Analytics Consulting

Costa Mesa, CA
3 weeks ago
AWS
Fraud Analytics
Machine Learning
Experian
Full-Time
Senior Manager

Senior Data Engineer

Buenos Aires, Argentina
3 weeks ago
AWS (Glue, S3, Athena, Step Functions)
Dbt
PySpark
Blend360
Full-Time
Expert

Sr. Software Engineer II - Data Solutions & Measurement

Czech Republic
3 weeks ago
Apache Spark
Backend Engineering
Kafka
Cint
Full-Time
Expert

Senior Data Engineer (AI & AWS)

São Paulo, Brazil
3 weeks ago
Apache Airflow
AWS
PySpark
Blend360
Part-Time
Expert

Senior Data Scientist - Innovation Lab

San Diego, CA
3 weeks ago
Deep Learning
LLMs / Generative AI
Machine Learning
Experian
Full-Time
Experienced

Data Engineer (Azure Databricks)

Zaragoza, Spain
4 weeks ago
Azure Data Factory
Azure Databricks
Delta Lake
Miratech
Full-Time
Experienced

Middle Data Engineer - Azure Databricks

Katowice, Poland
4 weeks ago
Azure Data Factory
Azure Databricks
Delta Lake
Miratech
Full-Time
Experienced

Sr. Databricks Architect

Houston, TX
4 weeks ago
Apache Spark
Databricks
Delta Lake
Cystems Logic Inc
Contractor
Expert

Data Engineer Analyst

Montevideo, Uruguay
4 weeks ago
AWS Glue
CDC/Streaming Ingestion
PySpark
Blend360
Full-Time
Experienced

Senior Data Engineer

Bogotá, Colombia
4 weeks ago
AWS (Glue, S3, Athena, Step Functions)
Data Modeling And CDC
PySpark
Blend360
Full-Time
Expert

Data Engineer Analyst

Santiago, Chile
1 month ago
AWS (Glue, S3, Athena, Step Functions)
CDC / Streaming Ingestion
PySpark
Blend360
Full-Time
Experienced

Data Engineer

Bogotá, Colombia
1 month ago
AWS (Glue, S3, Athena)
ETL & Data Pipeline Architecture
PySpark
Blend360
Full-Time
Experienced

Senior Data Engineer

Bogotá, Colombia
1 month ago
AWS (Glue, S3, Athena, Step Functions)
Data Pipeline Architecture & CDC
PySpark
Blend360
Full-Time
Expert

Senior Data Modeling Analyst

Costa Mesa, CA
1 month ago
AWS
Machine Learning
PySpark
Experian
Full-Time
Expert

Software Engineer - ETL & Automation Testing

Chennai, India
1 month ago
Automation Testing
Databricks
PySpark
NielsenIQ
Full-Time
Experienced

Senior Data Scientist - Operations

Kraków, Poland
1 month ago
Data Analysis
Machine Learning
PySpark
InPost
Full-Time
Expert

Software Engineer - ETL & Automation (PySpark, Databricks)

Chennai, India
1 month ago
Automation Testing
Databricks
PySpark
NielsenIQ
Full-Time
Experienced

Software Engineer - ETL & Automation (Python, PySpark)

Chennai, India
1 month ago
Automation Testing
Databricks
PySpark
NielsenIQ
Full-Time
Experienced

Data Scientist, Advertising Products & Solutions

New York, New York
1 month ago
Data Analysis
PySpark
Python
NBCUniversal
Full-Time
Entry Level
YEAR $95000 - $130000

Solution Architect, Data Science & AI

Greenville, SC
1 month ago
Databricks
Generative AI (RAG)
MLflow
Hitachi Solutions
Full-Time
Expert
YEAR $170000 - $200000

Solution Architect - Palantir Foundry

Seattle, WA
2 months ago
Data Pipeline Development
Identity and Access Management (IAM)
Palantir Foundry
Logic20/20 Inc.
Contractor
Expert
HOUR $142 - $162

Pharma Data Analyst

Lisboa, Portugal
2 months ago
AWS Data Lake
AWS QuickSight
Dimensional Data Modelling
Devoteam
Full-Time
Experienced

Ad Products Intern – Academic Year

New York, NY
2 months ago
PySpark
Python
Snowflake
NBCUniversal
Intern
Entry Level
HOUR $19 - $19

Data Engineer (Azure Databricks)

Madrid, Spain
2 months ago
Azure Data Factory
Azure Databricks
Delta Lake
Miratech
Full-Time
Experienced

Data Engineer II - AWS

Orlando, FL
2 months ago
Apache Airflow / Amazon MWAA
AWS (S3, Redshift, Glue)
PySpark
Versant
Full-Time
Experienced

Looking for a specific job?