Staff Site Reliability Engineer - Data Engineering, Platform
Company: Ellation, Inc.
Location: San Francisco
Posted on: November 13, 2024
|
|
Job Description:
Who We Are
We're a cast of characters working to shine a spotlight on anime.
is an international business focused on creating both online and
offline experiences for fans through content (licensed,
co-produced, originals, distribution), merchandise, events, gaming,
news, and more. Visit our pages for more information about our
collection of brands.
About the Team
The Site Reliability Engineering (SRE) team is dedicated to
ensuring the reliability, scalability, and performance of our data
infrastructure. We focus on standardizing and implementing
monitoring and alerting across all datastores to track key metrics
like errors, latency, and throughput, and to ensure critical
systems are covered. Our team also leads efforts to keep databases
up-to-date, implements Infrastructure as Code (IaC) for high
availability and performance, and automates key processes to
enhance operational efficiency.
We lead and evangelize the principle of 100% automation.
Additionally, we define and document operational requirements,
develop incident response processes, and automate monitoring and
compliance checks to maintain a secure and reliable data
environment. By continuously improving load testing and optimizing
data governance practices, we support the overall health and
efficiency of our data systems.
About the Role
Crunchyroll is growing and changing, presenting unique challenges
and opportunities to support millions of anime fans around the
world. The Data Engineering team provides seamless help to our
internal stakeholders, ensuring an exceptional experience for all
Crunchyroll fans.
As a Staff Site Reliability Engineer for the Data Engineering team,
you will be responsible for maintaining and enhancing the
reliability of our data infrastructure. Your work will directly
impact the availability and performance of our data services,
enabling the organization to better decisions. You will collaborate
closely with data engineers, and software engineers to develop and
drive 100% automation, best practices for deep monitoring and
alerting. This role will report to our Director of Data
Engineering. While it is preferred for this role to sit in one of
our offices, fully remote is also an option in the United
States.
About You
Why you will love working at Crunchyroll
Not only will you get to work with fun, passionate and inspired
colleagues, you will also...
#LifeAtCrunchyroll #LI-Remote
#J-18808-Ljbffr
Keywords: Ellation, Inc., Richmond , Staff Site Reliability Engineer - Data Engineering, Platform, Engineering , San Francisco, California
Click
here to apply!
|