hero

AEG Job Board

Discover career opportunities in the AEG Sponsor network

Lead Site Reliability Engineer

Regeneron

Regeneron

Software Engineering
Tarrytown, NY, USA
Posted on Nov 21, 2024

The Genome Informatics and Data Engineering team is looking for a passionate Lead Site Reliability Engineer with strong technical ability, communication, and collaboration skills. As passionate Lead Site Reliability Engineer, you will be responsible for managing AWS platform, implementation & management of solutions on AWS, from the ground up, to deliver highly scalable services. You will work with stakeholders, rest of the RGC IT team, and subject matter experts to build and implement cloud platform solutions.

Key focus areas include:

  • AWS Platform Administration.
  • Maintain security and compliance posture of workloads.
  • Create/Manage serverless and containerized applications.
  • Create/Manage core AWS services resources such as EC2, ECS, EFS, S3, IAM, CloudWatch ,Lambda, RDS, Redshift.
  • Develop Infrastructure as Code (IaC) templates leveraging Cloud Formation.
  • Develop Python and bash scripts for automating routine tasks.
  • Monitor system performance and maintain system health.
  • Resolve the support requests and incidents.

Good to have:

  • Experience in Database operations.
  • Understanding of Data & Analytics.
  • Knowledge on Slurm.
  • Knowledge on WDL.
  • Knowledge on distributed computing and MPI based applications.
  • Knowledge on Application Development & API management.
  • Knowledge on Kubernetes.

In this role, a typical day might include the following:

  • Provide day to day operational support and perform systems administration tasks
  • Apply industry standards and best practices to ensure system and application security
  • Drive automation of operations and management of infrastructure as code
  • Deploy and manage monitoring metrics and logging capabilities on RGC’s systems & Applications
  • Containerize various applications and tools and implement container orchestration
  • Develop SOPs and configure cloud services to stand up high compute pipelines
  • Installs, configures, and maintains cloud services & applications on cloud platform
  • Keep abreast of the latest advances in the cloud platforms and services
  • Provide end user support, training, and documentation.

This job might be for you if:

  • You have an eye for detail and pride yourself on the quality of your work. Operational excellence matters more than just finishing the tasks.
  • You thrive in a fast-paced environment and enables us to deliver and improve on the product quickly.
  • With your sleeves rolled up, you work on current problems while thinking of future solutions.

To be considered for the Lead Site Reliability Engineer position, you must have 8+ years of in-depth experience in AWS and Linux-based operating systems. Expertise in implementing containers and container orchestration using Docker, ECS, ECR, EKS, Fargate, etc. Experience in developing Python, bash scripts, and cloud formation templates. Hands-on experience in Linux administration, CI/CD pipeline development, and core AWS services such as EC2, ECS, S3, ALB, EFS, RDS, CloudWatch, Lambda, and IAM. Knowledge of database operations and monitoring tools like Splunk, Grafana, and Prometheus. AWS and other related certifications are a plus. Level will be commensurate with experience and qualifications. Must be onsite 3 days a week.

#RGCGIDE
#LI-hybrid

#RGCGI

Does this sound like you? Apply now to take your first step towards living the Regeneron Way! We have an inclusive and diverse culture that provides comprehensive benefits, which often include (depending on location) health and wellness programs, fitness centers, equity awards, annual bonuses, and paid time off for eligible employees at all levels!

Regeneron is an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion or belief (or lack thereof), sex, nationality, national or ethnic origin, civil status, age, citizenship status, membership of the Traveler community, sexual orientation, disability, genetic information, familial status, marital or registered civil partnership status, pregnancy or parental status, gender identity, gender reassignment, military or veteran status, or any other protected characteristic in accordance with applicable laws and regulations. The Company will also provide reasonable accommodation to the known disabilities or chronic illnesses of an otherwise qualified applicant for employment, unless the accommodation would impose undue hardship on the operation of the Company's business.

For roles in which the hired candidate will be working in the U.S., the salary ranges provided are shown in accordance with U.S. law and apply to U.S.-based positions. For roles which will be based in Japan and/or Canada, the salary ranges are shown in accordance with the applicable local law and currency. If you are outside the U.S, Japan or Canada, please speak with your recruiter about salaries and benefits in your location.

Please note that certain background checks will form part of the recruitment process. Background checks will be conducted in accordance with the law of the country where the position is based, including the type of background checks conducted. The purpose of carrying out such checks is for Regeneron to verify certain information regarding a candidate prior to the commencement of employment such as identity, right to work, educational qualifications etc.

Salary Range (annually)

$126,700.00 - $206,900.00