Surendra Kumar

Surendra Kumar

Site Reliability Engineer / DevOps Professional
📍 Hyderabad, India

Professional Summary

Site Reliability Engineer and DevOps professional with 10+ years of industry experience spanning cloud and on-premises infrastructure, automation, and system reliability. Proven expertise in designing and scaling AWS environments with Terraform, managing hybrid data centers, and implementing CI/CD pipelines to accelerate delivery. Skilled in containerization with Docker, troubleshooting complex Linux/Windows systems, and driving operational excellence through proactive monitoring, load balancing, and auto scaling. Adept at collaborating across teams to build resilient, high-performance systems in diverse industries including telecom, technology, and financial services.

Experience

SRE Engineer
May 2024 – Present
Capgemini · Hyderabad
  • Working in the financial services and insurance sector, ensuring high availability and reliability of critical applications.
  • Implementing monitoring, alerting, and incident response frameworks to minimize downtime and improve system resilience.
  • Automating infrastructure provisioning and scaling using Terraform, Docker, and CI/CD pipelines.
  • Collaborating with development and operations teams to embed SRE best practices into workflows.
  • Conducting root cause analysis and performance tuning to enhance system efficiency.
  • Supporting hybrid environments, integrating cloud and on-prem infrastructure for seamless operations.
DevOps Engineer
Jan 2022 – Oct 2023
Cyient · Hyderabad
  • Designed and maintained CI/CD pipelines to streamline application deployment and reduce release cycles.
  • Automated infrastructure provisioning using Terraform and configuration management tools.
  • Deployed and managed Dockerized applications, improving scalability and portability.
  • Implemented monitoring and alerting systems to enhance reliability and reduce downtime.
  • Collaborated with cross-functional teams to integrate DevOps practices into development workflows.
  • Troubleshot complex Linux/Windows issues, ensuring smooth operations across hybrid environments.
JC Network Ops
May 2018 – Jan 2022
Reliance Jio · Bokaro Steel City
  • Designed and implemented new network infrastructure, ensuring scalability and reliability.
  • Executed integration of network components into existing systems, enabling seamless service delivery.
  • Coordinated end-to-end deployment activities, from design to "ready to use" rollout.
  • Conducted testing, validation, and documentation of installations for compliance and operational readiness.
  • Collaborated with cross-functional teams to optimize deployment timelines and minimize service disruption.
Field Operation Engineer
Feb 2016 – May 2018
IMSI Global / Evolve Technologies and Services · West Bengal
  • Performed maintenance and upgrades of telecom infrastructure, ensuring network stability and service continuity.
  • Coordinated with regional teams to execute infrastructure upgrades with minimal downtime.
  • Conducted preventive maintenance and troubleshooting to reduce recurring faults.
  • Supported rollout of new infrastructure components, aligning with operational standards.
  • Documented maintenance activities and upgrade reports for compliance and audit purposes.
Project Engineer
Jan 2015 – Dec 2015
Accord Synergy · Patna
  • Led installation and commissioning of new infrastructure for telecom and network systems.
  • Coordinated with cross-functional teams to ensure timely project delivery and compliance with technical standards.
  • Conducted site inspections, troubleshooting, and quality assurance during deployment phases.
  • Documented installation procedures and prepared handover reports for client validation.

Skills

Cloud Infrastructure
AWSTerraformEC2 Auto ScalingLoad Balancers
On-Prem Infrastructure
VMwareHyper-VBare-metal servers
DevOps & Automation
DockerJenkinsGit CI/CD WorkflowsUdeploy
System Administration
Linux TroubleshootingWindowsCron Jobs
Monitoring Tools
SplunkDynatrace
Reliability Engineering
MonitoringIncident ResponseRoot Cause Analysis
Communication
Stakeholder CollaborationBackend Verification

Education

Bachelor of Engineering, EEE
2010 – 2014
Don Bosco Institute of Technology (DBIT) · Bengaluru

Certifications

Azure AI Fundamentals Sep 2025
Azure Fundamentals May 2025
AWS Certified Cloud Practitioner Dec 2024