PySpark Remote Jobs

Find remote jobs requiring PySpark skills. Apply now and work from anywhere.

PySpark is the Python interface for Apache Spark. It lets you write Python code to process and analyze very large datasets across multiple machines. Typical tasks include cleaning and transforming data, batch and stream processing, and running machine learning workflows.

This skill is useful for remote work because many PySpark jobs run on cloud clusters you can access from anywhere. You can share notebooks and code through Git, schedule and automate pipelines, and collaborate asynchronously with teammates. Employers look for people who can design reliable, reproducible workflows and troubleshoot distributed systems without being onsite.

Industries that commonly use PySpark include:

  • Technology and software platforms
  • Finance and insurance
  • Healthcare and life sciences
  • Retail and e-commerce
  • Advertising and media analytics
  • Telecommunications and utilities

To develop this skill, start with strong Python fundamentals and basic data engineering concepts. Practice on a local Spark setup, then move to cloud-managed clusters to learn deployment and scaling. Build projects that use dataframes, structured streaming, and Spark ML, and share your code on GitHub. Read the official docs, follow tutorials, and participate in community forums to deepen your knowledge.

When applying for remote roles, highlight concrete examples: a pipeline you built, a performance problem you fixed, or a model you trained with Spark. Describe the tools you used to collaborate, test, and monitor jobs so hiring managers can see how you will contribute to a distributed team.

Director, Fraud Analytics Consulting

Costa Mesa, CA
22 hours ago
AWS
Fraud Analytics
Machine Learning
Experian
Full-Time
Senior Manager

Senior Data Engineer

Buenos Aires, Argentina
22 hours ago
AWS (Glue, S3, Athena, Step Functions)
Dbt
PySpark
Blend360
Full-Time
Expert

Senior Data Engineer (AI & AWS)

São Paulo, Brazil
4 days ago
Apache Airflow
AWS
PySpark
Blend360
Part-Time
Expert

Senior Data Scientist - Innovation Lab

San Diego, CA
5 days ago
Deep Learning
LLMs / Generative AI
Machine Learning
Experian
Full-Time
Experienced

Data Engineer (Azure Databricks)

Zaragoza, Spain
1 week ago
Azure Data Factory
Azure Databricks
Delta Lake
Miratech
Full-Time
Experienced

Middle Data Engineer - Azure Databricks

Katowice, Poland
1 week ago
Azure Data Factory
Azure Databricks
Delta Lake
Miratech
Full-Time
Experienced

Sr. Databricks Architect

Houston, TX
1 week ago
Apache Spark
Databricks
Delta Lake
Cystems Logic Inc
Contractor
Expert

Data Engineer Analyst

Montevideo, Uruguay
1 week ago
AWS Glue
CDC/Streaming Ingestion
PySpark
Blend360
Full-Time
Experienced

Senior Data Engineer

Bogotá, Colombia
1 week ago
AWS (Glue, S3, Athena, Step Functions)
Data Modeling And CDC
PySpark
Blend360
Full-Time
Expert

Data Engineer Analyst

Santiago, Chile
1 week ago
AWS (Glue, S3, Athena, Step Functions)
CDC / Streaming Ingestion
PySpark
Blend360
Full-Time
Experienced

Data Engineer

Bogotá, Colombia
2 weeks ago
AWS (Glue, S3, Athena)
ETL & Data Pipeline Architecture
PySpark
Blend360
Full-Time
Experienced

Senior Data Engineer

Bogotá, Colombia
2 weeks ago
AWS (Glue, S3, Athena, Step Functions)
Data Pipeline Architecture & CDC
PySpark
Blend360
Full-Time
Expert

Senior Data Modeling Analyst

Costa Mesa, CA
2 weeks ago
AWS
Machine Learning
PySpark
Experian
Full-Time
Expert

Software Engineer - ETL & Automation Testing

Chennai, India
2 weeks ago
Automation Testing
Databricks
PySpark
NielsenIQ
Full-Time
Experienced

Senior Data Scientist - Operations

Kraków, Poland
2 weeks ago
Data Analysis
Machine Learning
PySpark
InPost
Full-Time
Expert

Software Engineer - ETL & Automation (PySpark, Databricks)

Chennai, India
3 weeks ago
Automation Testing
Databricks
PySpark
NielsenIQ
Full-Time
Experienced

Software Engineer - ETL & Automation (Python, PySpark)

Chennai, India
3 weeks ago
Automation Testing
Databricks
PySpark
NielsenIQ
Full-Time
Experienced

Data Scientist, Advertising Products & Solutions

New York, New York
1 month ago
Data Analysis
PySpark
Python
NBCUniversal
Full-Time
Entry Level
YEAR $95000 - $130000

Solution Architect, Data Science & AI

Greenville, SC
1 month ago
Databricks
Generative AI (RAG)
MLflow
Hitachi Solutions
Full-Time
Expert
YEAR $170000 - $200000

Solution Architect - Palantir Foundry

Seattle, WA
1 month ago
Data Pipeline Development
Identity and Access Management (IAM)
Palantir Foundry
Logic20/20 Inc.
Contractor
Expert
HOUR $142 - $162

Pharma Data Analyst

Lisboa, Portugal
1 month ago
AWS Data Lake
AWS QuickSight
Dimensional Data Modelling
Devoteam
Full-Time
Experienced

Ad Products Intern – Academic Year

New York, NY
1 month ago
PySpark
Python
Snowflake
NBCUniversal
Intern
Entry Level
HOUR $19 - $19

Data Engineer (Azure Databricks)

Madrid, Spain
1 month ago
Azure Data Factory
Azure Databricks
Delta Lake
Miratech
Full-Time
Experienced

Data Engineer II - AWS

Orlando, FL
1 month ago
Apache Airflow / Amazon MWAA
AWS (S3, Redshift, Glue)
PySpark
Versant
Full-Time
Experienced

Lead Data Scientist

Vadodara, India
1 month ago
Data Cleaning
Econometrics
PySpark
NielsenIQ
Full-Time
Senior Manager

Solution Architect, Palantir Foundry (1099)

Seattle, WA
1 month ago
Data Pipeline Design
Identity and Access Management (IAM)
Palantir Foundry
Logic20/20 Inc.
Contractor
Experienced
HOUR $156 - $164

Senior Data Scientist - Personalisation

Kraków, Poland
2 months ago
Cloud (Databricks/Azure/GCP/AWS/Snowflake)
Data Analysis
Machine Learning
InPost
Full-Time
Experienced

Middle Data Engineer - Azure Databricks

Warsaw, Poland
2 months ago
Azure Data Factory
Azure Databricks
Delta Lake
Miratech
Full-Time
Experienced

Senior Data Scientist, Innovation Lab

San Diego, CA
2 months ago
Deep Learning
Machine Learning
PySpark
Experian
Full-Time
Experienced

Middle Data Analyst

Tiranë, Albania
2 months ago
Business Intelligence (BI)
Exploratory Data Analysis (EDA)
PySpark
Sigma Software
Full-Time
Experienced

Looking for a specific job?