Site Reliability Engineer L4/L5 – Live Cloud Platform SRE
Job Description:
- Drive continual improvement in observability, monitoring, and scalability with the primary goal to solve the thundering herd problem with cloud traffic (API gateway, IPC between microservices) for live streaming.
- Implement, automate, execute, and analyze the results from a broad range of live streaming delivery focused functional, performance, resilience, and fault injection testing.
- Write and review code, develop documentation and capacity plans, and debug the hardest problems on some of the largest and most complex systems in the world.
- Coordination, collaboration, and partnership across multiple stakeholders for the smooth execution of live-streaming events.
- Participate in an on-call rotation and be able to work with flexible hours based on the live events schedule.
Requirements:
- 5+ years of service reliability/operational experience running large-scale, high-performance systems & internet services with a focus on traffic at scale.
- Knowledge of and proven experience with L4 Load Balancer, HTTP cache, and reverse proxy technologies.
- Expert-level knowledge of Unix or Linux systems and TCP/IP network fundamentals.
- Proficient understanding of networking principles, transport, and application protocols, especially DNS, TLS, and HTTP(s) etc.
- Proficient in a programming language such as Go, Python, Rust etc.
- Experience with using real-time and Big Data analytic processing technologies (Kafka, time series database and Presto/Trino, Spark SQL, etc)
- Ability to work in a highly collaborative environment and to communicate effectively with internal and external partners.
- Preferred - B.S. in Computer Science, Electrical or Computer Engineering (or equivalent professional experience).
Benefits:
- Health Plans
- Mental Health support
- 401(k) Retirement Plan with employer match
- Stock Option Program
- Disability Programs
- Health Savings and Flexible Spending Accounts
- Family-forming benefits
- Life and Serious Injury Benefits
- Paid leave of absence programs
- 35 days annually for paid time off
Apply tot his job Apply To this Job