Full-Time Data Engineer

RStudio is hiring a remote Full-Time Data Engineer. The career level for this job opening is Expert and is accepting USA based applicants remotely. Read complete job description before applying.

This job was posted 4 months ago and is likely no longer active. We encourage you to explore more recent opportunities on our site. However, you may still try your luck using 'Apply Now' link below. We recommend focusing on newer listings available here.

RStudio

Job Title

Data Engineer

Posted

Career Level

Full-Time

Career Level

Expert

Locations Accepted

USA

Job Details

Posit is seeking our first Data Engineer as part of our Data Science Center of Excellence team.

In this position, you will be responsible for the infrastructure that stores Posit’s corporate data, operating the software used for corporate data science, the third-party integrations that bring data in, and the ETL/ELT code and tools that process the data. As part of the Data Science Center of Excellence team, you will work closely with the CloudOps, Security, and Enterprise Information Management teams to ensure the integrity and governance of our data, and with other teams throughout the organization in order to ensure the right data is in the right place so we can make data driven decisions to help us achieve our goals. You will also work closely with end users of data pipelines across the organization to understand their specific problems and collaborate to plan and develop ETL/ELT pipelines contributing to solutions.

What you’ll own:

  • the operational excellence, reliability, and security of our data infrastructure and services, and advocate for investments to improve them
  • the infrastructure, operations, and deployment pipelines of our data infrastructure
  • ensuring that new features and functionality are designed and built with operational considerations, scalability, cost effectiveness, and sustainability in mind

What you’ll help with:

  • ensuring appropriate metrics and monitoring are in place to provide actionable alerting with a high signal to noise ratio
  • improving our infrastructure as code and continuous integration and deployment pipelines on consistent basis
  • Planning and executing tasks related to the company’s data governance strategy
  • giving and receiving feedback from other engineers in the form of code reviews and blameless post-mortems
  • cross-functional collaboration working closely with teams across the organization to ensure data availability and support critical company operations, reporting, and data science projects

What you’ll teach:

  • anti-patterns learned from prior experiences handling operational incidents
  • the tools, tips, and tricks that make your professional life easier

What you’ll learn:

  • Our current ETL/ELT toolchain of Fivetran, dbt, and Glue along with data storage tools including Amazon Redshift and S3
  • metrics and monitoring using Datadog
  • infrastructure as code using Pulumi and Python
  • the Posit products and how their data science customers work

About you:

You have 5+ years of professional experience writing software to manage data pipelines and infrastructure. You are user-focused and driven by our mission to facilitate data science and education for everyone. You share our commitment to building great software by striving for robust design, clean and well-tested code, and delightful user experiences. You excel at breaking down complex problems into bite-size tasks and driving them to completion. You are able to act as a champion of data engineering and lead related internal efforts. You have a strong proficiency in Python, SQL, database management (both relational and non-relational), and have the ability to become an expert in our current toolchain, including dbt and Amazon S3. You are familiar with a broad set of data engineering tools, including Parquet and Iceberg, and are comfortable recommending new tools when they are the best solution for business problems. You love to learn and help others succeed through code reviews and other forms of mentorship. You are humble, pragmatic, deliberate, and you have a keen sense of empathy for your co-workers and users. 

Within 1 month you will:

  • get to know the larger Data Science COE and CloudOps teams and how we create, deliver and maintain software
  • build and prioritize a backlog of work

Within 3 months you will:

  • begin replacing existing business-critical data pipelines using our ETL/ELT toolchain
  • establish process to take ownership of existing data pipelines to actively maintain and monitor for consistency and accuracy of data

Within 6 months you will:

  • own or demonstrate expertise in multiple areas of our data infrastructure and pipelines
  • research problems and new technologies and effectively communicate findings to the team
  • identify underutilized or challenging data sources and develop enhanced data pipelines to unlock their potential for driving improved business outcomes
  • propose significant projects and lead them


FAQs

What is the last date for applying to the job?

The deadline to apply for Full-Time Data Engineer at RStudio is 16th of September 2024 . We consider jobs older than one month to have expired.

Which countries are accepted for this remote job?

This job accepts [ USA ] applicants. .

Related Jobs You May Like

Lead Data Scientist

USA
3 days ago
Data Analysis
Data Engineering
Machine Learning
Stellar Health
Full-Time
Experienced
YEAR $175000 - $220000

Data Scientist

USA
3 days ago
Data Analysis
Machine Learning
Python
Wpromote
Full-Time
Experienced
YEAR $117500 - $132500

Solutions Engineer

USA
4 days ago
Data Analytics
Data Visualization
Machine Learning
NobleAI
Full-Time
Experienced
YEAR $200000 - $220000

Field Application Scientist I

West Coast / Remote
4 days ago
Customer Support
Data Analysis
Instrumentation
LGC Group
Full-Time
Entry Level

Data Scientist II

USA
4 days ago
Data Analysis
Machine Learning
Python
Oscar
Full-Time
Experienced
YEAR $131200 - $172200

Senior Data Scientist

USA
5 days ago
data modeling
Lead Scoring
Python
Goodleap
Full-Time
Senior Manager

Healthcare Data Scientist

Ann Arbor, MI
5 days ago
Communication
Problem-solving
Python
ArborMetrix
Full-Time
Experienced
YEAR $75000 - $95000

Healthcare Data Scientist - SAS Programmer

Ann Arbor, MI
5 days ago
Data Analysis
Python
SAS
ArborMetrix
Full-Time
Experienced
YEAR $85000 - $100000

Senior Data Scientist

Costa Mesa, CA
6 days ago
Cloud Computing
Data Science
Machine Learning
Experian
Full-Time
Senior Manager

Senior Data Scientist

UK
1 week ago
Algorithms
Coding
Data Analysis
StackAdapt
Full-Time
Senior Manager

Staff Data Scientist

USA
1 week ago
Data Analysis
Healthcare Data
Machine Learning
Machinify
Full-Time
Expert
YEAR $200000 - $250000

Staff Data Scientist, Guest Journey

USA
1 week ago
Causal Inference
Data Analysis
Machine Learning
Airbnb
Full-Time
Expert
YEAR $192000 - $243000

Looking for a specific job?