Site Reliability Engineering Specialist
Company: Disability Solutions
Location: Augusta
Posted on: November 18, 2024
Job Description:
Position Type : Full time Type Of Hire : Experienced (relevant
combo of work and education) Travel Percentage : 1 - 5%Job
DescriptionWe are FIS. Our technology powers the world's economy
and our teams bring innovation to life. We champion diversity to
deliver the best products and solutions for our colleagues, clients
and communities. If you're ready to start learning, growing and
making an impact with a career in fintech, we'd like to know: Are
you FIS?--Role and Responsibilities--Reporting to the Head of Cloud
Enablement Engineering, the Site Reliability Engineer will play a
critical role in driving innovation and growth for the Banking
Solutions business.-- In this role, the candidate will have the
opportunity to make a lasting impact on the company's digital
transformation journey, drive customer-centric innovation and
automation, and position the organization as a leader in the
competitive digital banking landscape. Specifically, the Site
Reliability Engineer will be responsible for the following:----
- Design and maintain monitoring solutions and alerting
mechanisms for infrastructure, application performance, and user
experience metrics, enabling proactive issue detection and
mitigation.--
- Implement automation tools and processes to automate routine
tasks, scale infrastructure, and ensure seamless deployments,
updates, and rollbacks with minimal user impact.--
- Ensure the reliability, availability, and performance of
applications and services, focusing on minimizing downtime,
optimizing response times, and maintaining high availability for
users.--
- Lead incident response efforts for incidents, including
identification, triage, resolution, and post-incident analysis to
prevent recurrence and improve system resilience.--
- Conduct capacity planning, performance tuning, and resource
optimization for environments, collaborating with development and
operations teams to meet scalability and performance goals.--
- Collaborate with security teams to implement security best
practices, perform vulnerability assessments, and ensure compliance
with security standards and regulatory requirements for
applications.--
- Manage deployment pipelines, release processes, and
configuration management for app deployments, ensuring consistency,
reliability, and version control across environments.--
- Identify areas for improvement in reliability, performance, and
efficiency through data analysis, root cause analysis, and trend
analysis, and drive initiatives to enhance system reliability and
operational efficiency.--
- Create and maintain documentation, runbooks, and knowledge base
articles for operational procedures, troubleshooting guides, and
best practices, and promote knowledge sharing within the team.--
- Develop and test disaster recovery plans, backup strategies,
and failover mechanisms for app services, ensuring business
continuity and data integrity in case of failures or disasters.--
- Collaborate with development, QA, DevOps, and product teams to
ensure alignment on reliability goals, performance metrics, release
schedules, and incident response processes.--
- Participate in on-call rotations and provide 24/7 support for
critical incidents, troubleshoot issues, and coordinate with teams
for resolution, escalation, and follow-up actions as per defined
SLAs.----Professional Qualifications--
- Proficient in development technologies, architectures, and
platforms (web, api) to understand system complexities and
performance considerations.--
- Experience in cloud platforms (e.g., AWS, Azure, Google Cloud)
and infrastructure as code (IaC) tools for managing app
infrastructure and deployments.--
- Knowledge of monitoring tools (e.g., Prometheus, Grafana,
DataDog, New Relic) and logging frameworks (e.g., Splunk,
SumoLogic, ELK Stack) for real-time visibility into system health,
performance metrics, and user experience.--
- Experience in incident management, including incident response,
triage, root cause analysis (RCA), and post-mortem reviews to
prevent recurring issues.--
- Strong troubleshooting skills to diagnose complex technical
issues in app environments, infrastructure, networking, and
performance bottlenecks.--
- Proficiency in scripting languages (e.g., Python, Bash) and
automation tools (e.g., Terraform, Ansible) for automating routine
tasks, deployments, and infrastructure management.--
- Experience in implementing continuous integration/continuous
deployment (CI/CD) pipelines for apps using tools like Jenkins,
GitLab CI/CD, or Azure DevOps.--
- Expertise in setting up monitoring solutions, configuring
alerts, and creating dashboards to monitor system performance,
application metrics, and user experience.--
- --Familiarity with APM (Application Performance Monitoring)
tools to analyze app performance, identify bottlenecks, and
optimize resource utilization.--
- Familiarity with RUM (Real User Monitoring) for tracking and
analyzing user interaction and system performance.--
- Commitment to continuous learning, staying updated with
industry trends, new technologies, and best practices in app
reliability, performance, and operations.--
- --Adaptability to evolving requirements, technologies, and
business needs, with a focus on driving continuous improvement and
operational excellence.----Personal Characteristics--
- Demonstrates judgment and flexibility; thinks about issues and
develops solutions that thoughtfully take the broader context into
account - positively deals with a shifting demand for time,
priorities, and the rapid change of environments.--
- Takes an ownership approach to engineering and product
outcomes.--
- Action-oriented self-starter who can set strategy and drive
execution with a "roll up the sleeves" approach.--
- Excellent interpersonal communication, negotiation and
influencing skills to work effectively with all stakeholders
(internal & external), making information-based decisions.--
- Penchant for excellence, both personally and professionally,
demonstrated by intellectual curiosity, record of accomplishment,
and reputation; shows strong attention to detail and implementation
of best practices with an inclination for continuous improvement.--
- Ability to quickly establish strong credibility with employees,
business partners and external resources.--
- Embodies and delivers the firm's values and culture towards
colleagues, clients, and communities:--
- Win as one team--
- Lead with integrity--
- Be the change--FIS is committed to providing its employees with
an exciting career opportunity and competitive compensation. The
pay range for this full-time position is $131,290.00 - $220,600.00
and reflects the minimum and maximum target for new hire salaries
for this position based on the posted role, level, and location.
Within the range, actual individual starting pay is determined
additional factors, including job-related skills, experience, and
relevant education or training. Any changes in work location will
also impact actual individual starting pay. Please consult with
your recruiter about the specific salary range for your preferred
location during the hiring process.Privacy StatementFIS is
committed to protecting the privacy and security of all personal
information that we process in order to provide services to our
clients. For specific information on how FIS protects personal
information online, please see the Online Privacy Notice.EEOC
StatementFIS is an equal opportunity employer. We evaluate
qualified applicants without regard to race, color, religion, sex,
sexual orientation, gender identity, marital status, genetic
information, national origin, disability, veteran status, and other
protected characteristics. The EEO is the Law poster is available
here supplement document available hereFor positions located in the
US, the following conditions apply. If you are made a conditional
offer of employment, you will be required to undergo a drug test.
ADA Disclaimer: In developing this job description care was taken
to include all competencies needed to successfully perform in this
position. However, for Americans with Disabilities Act (ADA)
purposes, the essential functions of the job may or may not have
been described for purposes of ADA reasonable accommodation. All
reasonable accommodation requests will be reviewed and evaluated on
a case-by-case basis.Sourcing ModelRecruitment at FIS works
primarily on a direct sourcing model; a relatively small portion of
our hiring is through recruitment agencies. FIS does not accept
resumes from recruitment agencies which are not on the preferred
supplier list and is not responsible for any related fees for
resumes submitted to job postings, our employees, or any other part
of our company.#pridepass
Keywords: Disability Solutions, Augusta , Site Reliability Engineering Specialist, Engineering , Augusta, Georgia
Didn't find what you're looking for? Search again!
Loading more jobs...