Astronomer Logo

Astronomer

Senior Customer Reliability Engineer, Infrastructure

Posted 7 Days Ago
Remote
Hiring Remotely in United States
165K-185K Annually
Senior level
Remote
Hiring Remotely in United States
165K-185K Annually
Senior level
The role involves ensuring customer success with Astronomer's managed Airflow service, focusing on cloud infrastructure reliability, troubleshooting issues, and enhancing customer experience.
The summary above was generated by AI

Astronomer empowers data teams to bring mission-critical software, analytics, and AI to life and is the company behind Astro, the industry-leading unified DataOps platform powered by Apache Airflow®. Astro accelerates building reliable data products that unlock insights, unleash AI value, and powers data-driven applications. Trusted by more than 700 of the world's leading enterprises, Astronomer lets businesses do more with their data. To learn more, visit www.astronomer.io.

Your background may be unconventional; as long as you have the essential qualifications, we encourage you to apply. While having "bonus" qualifications makes for a strong candidate, Astronomer values diverse experiences. Many of us at Astronomer haven't followed traditional career paths, and we welcome it if yours hasn't either.

About this role:

The Astronomer Customer Reliability Engineering (CRE) team is responsible for the success of our customers' usage of our managed Airflow service.

The CRE are responsible for operating, monitoring, and maintaining the platform to ensure availability, predictability, and reliable operations.

As an infrastructure specialist within the team, you will focus on the reliability of the underlying cloud infrastructure and Kubernetes clusters. This entails responding to incidents either raised by a customer, or from our monitoring system and then taking further steps to ensure problems are permanently resolved or monitored. As owners of the observability platform, CRE has unlimited potential to improve the reliability of the product and deliver the best possible outcome for our customers.

This role is directly customer-facing and gives exposure to very diverse problems and requirements. The CRE get the opportunity to interface with customers from a variety of industries across different cloud providers, and all with different expectations. Your contributions will directly impact customers' success with using the Astronomer products, and you will be able to help make meaningful improvements to the customer experience.

This position includes a requirement to work from 12PM to 6PM PST, Monday to Friday. Your remaining work time is flexible.

What you get to do:

  • Provide solutions to customers to make them successful using our products.

  • Troubleshoot Customer environments and engage in active triaging with customers

  • Provide feedback to the product development teams on customer needs and pain points.

  • Build out our monitoring and alerting systems.

  • Build and maintain automation to ensure daily operational tasks are handled as efficiently as possible. 

  • Help direct the architecture of the products and contribute where possible.

  • Own the customer experience, working directly with customers to prioritize and solve issues, meet SLAs, and provide “white glove” guidance on the path to production.

  • Participate remotely within a fully distributed team.

  • Enhance and Enrich customer documentation

  • Work on a modern, sophisticated, cloud-native product that customers use to connect to dozens of other systems.

  • Help maintain 24x7 coverage through a specified 6-hour pager period during your work day.

  • Participate in paid on-call rotation for weekend coverage.

What you bring to the role:

  • 5 years of experience, preferably with large, complex SaaS infrastructures operating at scale

  • Commercial experience using or managing Kubernetes clusters

  • Experience managing a Production  distributed system with at least one major cloud provider (one or all: AWS, GCP, Azure)

  • Strong Network Experience with one of the major Clouds 

  • Strong Linux experience

  • Knowledge of how to operate and monitor issues for distributed systems 

  • Experience with Observability tools

  • Previous experience in handling customers issues (internal and external) 

  • Strong Communication Skills

  • DevOps or CI/CD experience

  • Python scripting

  • Good troubleshooting Skills 

Bonus points if you have:

  • Experience as a Site Reliability Engineer

  • Worked with Kubernetes Custom Resources

  • Depth of knowledge with Azure

  • Airflow/Big Data Orchestration experience

  • IaC experience

The estimated salary for this role ranges from $165,000-185,000, along with an equity component. This range is merely an estimate, and the width of the range reflects willingness to consider candidates with broad prior seniority. Actual compensation may deviate from this range based on skills, experience, and qualifications.

#LI-Remote

At Astronomer, we value diversity. We are an equal opportunity employer: we do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.  Astronomer is a remote-first company.

Top Skills

AWS
Azure
GCP
Kubernetes
Linux
Python

Similar Jobs

6 Hours Ago
Remote
Hybrid
2 Locations
190K-220K
Senior level
190K-220K
Senior level
Cloud • Greentech • Other • Energy
The Director of Customer Support will expand the customer support team, develop account playbooks, optimize support models, and drive customer experience initiatives.
Top Skills: LinuxOrchestration TechnologyStorageVirtualization
7 Hours Ago
Remote
Hybrid
Atlanta, GA, USA
23-45
Junior
23-45
Junior
Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
The SFS Customer Success Advocate assists Square Sellers with inquiries related to business financing, ensuring accurate data capture and high levels of customer satisfaction through effective communication and collaboration with cross-functional teams.
7 Hours Ago
Remote
Hybrid
3 Locations
Junior
Junior
Artificial Intelligence • Healthtech • Information Technology • Natural Language Processing • Software • Analytics • Generative AI
The Technical Implementation Consultant leads the onboarding of clients and partners, ensuring integration of terminology solutions into healthcare IT systems and addressing technical issues.
Top Skills: APIsC#Cs/HtmlFlat FilesJavaScriptJson/XmlPythonSftpsSQL

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account