We provide IT Staff Augmentation Services!

Site Reliability Engineer Resume

5.00/5 (Submit Your Rating)

NC

SUMMARY

  • Around 6 years of experience working as DevOps/SRE/DOKCER EKS/Terraform/Ansible/Build Release Engineer.
  • Good experiences executing DevOps strategy in various environments of Linux and Windows servers along with espousing cloud strategies based on AWS.
  • Knowledge and Experiences on Cloud technologies including Infrastructure as a Service, Platform as a Service, and Software as a Service provider (IaaS, PaaS, andSaaS)
  • Experience in optimizing volumes, EC2 instances and created multiple VPC instances and created alarms and notifications for EC2 instances using Cloud Watch.
  • Strong hands on experience in Amazon web services such as EC2, ECS, S3, Elastic Beanstalk, Elastic Load Balancing, Auto Scaling, Right Scale, RDS, VPC, Route53, Cloud Watch, Cloud Formation and IAM.
  • Professional experience in Configuring and Deploying instances on AWS and On - Premise Data Centres.
  • Build customized Amazon Machine Images (AMI), deploy AMIs to multiple regions and launch EC2 instances using these custom images.
  • Experience on Openshift Platform in managing Kubernetes deployments.
  • Strong understanding of infrastructure automation tools (Terraform, Cloud Formation Templates)
  • Experience setting up and maintaining auto-scaling AWS stacks using Terraform and scripting
  • Expertise in terraform for building, changing and versioning infrastructure
  • Experience implementing Capability deployment using terraform scripts
  • Experience on Chef Inspec for validating the Kubernetes cluster deployments on Openshift and GKE platform.
  • Worked with Apache pinot Kafka forHigh throughput for both publishing and subscribing, with disk structures that provide constant performance even with many terabytes of stored messages.
  • Used Apache pinot spark for processing large sets of data volumes for rapid processing and enhancing the output.
  • Extensive experience in configuration, deployment automation with various App Servers like OracleWebLogic, Web Sphere, JBoss, Webservers like Apache pinot Tomcat and more modernWeb Containers like Nginxetc.
  • Experience in building multi-Tier, highly available, fault tolerant and scalable applications using AWS Elastic Beanstalk, Amazon RDS, Elastic Load Balancing and Auto Scaling.
  • Experience working with Docker, Kubernetes, Docker Swarm and Micro Services
  • Created CI/CP Pipeline for Docker Build and Deployment
  • Experience engineering the Continuous Delivery automation processes for container technology operated onDockerSwarm
  • Experience migrating fromDockerswarmto Kubernetes, setting clusters on AWS
  • Experience building springboot restful API backend applications and node.js frontend applications
  • Setup Docker Swarm and Kubernetes cluster for Docker Container Management
  • Build and Deployed batch processing microservices
  • Experience in creating SCM Process, policies, procedures, and Release Notes to follow Development and allow QA teams to Release and/or deploy the highest quality product for the enterprise.
  • DevOps expert in Release and Deployment for deployment and Release management activities and procedures for deployment in various system environments.
  • Extensive experience includes SCM, Change/Incident Management.
  • Extensively worked on Jenkins, Bamboo and Build forge for continuous integration and for End-to-End automation for all build and deployments.
  • Extensively involved in writing the cookbooks and recipes.
  • Good Knowledge and experiences working with Bash Shell and Python Scripting
  • Deployment to AWS with Ansible and Terraform towards AWS or Google Compute cloud, and using Terraform for building, updating, and versioning infrastructure safely and efficiently, as well as creating custom in-house solutions with Ansible configurations.
  • Experience in Designing, Installing and Implementing Ansible configuration management system and in writing playbooks for Ansible and deploying applications.
  • Hands-on experience on Ansible and Ansible Tower as Configuration management tool, to automate repetitive tasks, quickly deploys critical applications, and proactively manages change.
  • Working with DevOps practices by AWS, Elastic Beanstalk & Docker with Kubernetes
  • Expertise in SCM tools such as SVN, GIT and perforce.
  • Experiences with different application servers like BEA Web Logic 8.1, Web Sphere 5.1, Tomcat 6.0, JBOSS AS 4.0 using various IDE’s such as Web Sphere Studio Application Developer (WSAD 5.0), Eclipse 2.0/3.0, Net Beans 6.1, and JDeveloper.
  • Hands on knowledge of Java development in front-end, middle-tier and back-end.
  • Experience in building multithreaded, high availability, high performance applications
  • Highly skilled to manage source code and maintain Distributed Version Control System (GIT) as well as Centralized VCS such as SVN and ClearCase including experiences with branching, tagging and maintaining the version across the environments.
  • Extensive experience in bash scripting and Python for automation.
  • Expertise in setup and configuration of Application Servers Web Sphere, Tomcat, JBoss, WebLogic and involved in deployment activities and Tomcat, JBOSS, WebLogic and Web Sphere Application servers
  • Expertise in Automating Applications using Jenkins Continuous Integration Tool.
  • Extensively worked on Jenkins
  • Experience running Jenkins on-top of Kubernetes to drive all automation with web-hooks tied into our Git system
  • Knowledge on continuous integration tool Jenkins for End to End automation for all build and deployments.
  • Experience with container-based deployments using Docker, working with Docker images, Docker hub and Docker registries, installation and configuring Kubernetes and clustering them.
  • Producing detailed Release plans, control Release Scope and monitoring milestones.
  • Coordinating activities with appropriate teams to meet release and project schedules.
  • Strong knowledge on source controller concepts like Branches, Merges and Tags.
  • Managed environments DEV, SIT, QA, UAT and PROD for various releases and designed instance strategies.

TECHNICAL SKILLS

Cloud Technologies: AWS (EC2, S3, ELB, Auto scaling, Elastic Beanstalk, Cloud Front, Cloud Formation, RDSDMS, VPC, Route 53, Cloud watch, Cloud trail, IAM, SNS, ECS, EKS etc)

Version Control Tool: Git, GitHub, GitLab, Bitbucket, ClearCase, SVN

CI /CD Tool: Jenkins/CloudBee, Circle CI

CM Tools: Ansible, Salt Stack

Infrastructure Provisoning Tool: CFT, Terraform Template

Build Tool: Maven & Gradle

Containerization Tool: Docker

Container Management tool: Docker Swarm, Kubernetes and AWS ECS, AWS EKS

Monitoring Tool: Nagios and Splunk

SOA/Web Services: SOAP, REST, Micro Services Architecture

Application /Web Servers: Nginx, Tomcat Apache, WebLogic and WebSphere

Scripting Languages: Python, Bash Shell Scripting

PROFESSIONAL EXPERIENCE

Confidential, NC

Site Reliability Engineer

Responsibilities:

  • Owned the complete lifecycle of multiple application stack back-end services from platform setup, configuration, deployment support, CI/CD setup to monitoring, performing capacity planning, system performance optimization, delivery of SLAs, 24/7 support and development of system documentation
  • Increased the reliability of operations by developing automated solutions, troubleshooting networking, deployment and application issues
  • Extensive involved in SRE monitoring and automation delivery teams.
  • Delivery and implementation of SRE analytics, log management, monitoring and automation solutions. Managing SLAs, project resource and delivery management.
  • Extensive involvement in Monitoring & Automation architecture design, engineering, implementation and delivery. Implementation and delivery of SRE tools in CI/CD pipeline utilizing GitHub, Jenkins, Puppet, JIRA, Confluence, ServiceNow and other tools.
  • Delivering reliability for business critical SecOps applications including their underlying infrastructure (hardware, virtual, Unix/Linux, Windows, storage, Oracle, Sybase, SQL, MySQL, networks, autantication services etc.)
  • Developed a stream filtering system using Spark streaming on top of Apache Kafka.
  • Used Flume, Kafka to aggregate log data into HDFS.
  • Gathering requirements for RFPs and POCs of monitoring & log management tools. Onboarding and delivering newer tools, tools consolidations and optimizing existing landscape to meet organizational technology goals.
  • Developed a stream filtering system using Spark streaming on top of Apache pinot Kafka.
  • Also worked on Apache pinot Hadoop and used Kafka for messaging system and spark for processing large sets of data.
  • Worked on Apache pinot Mesos-Marathon for Resource Management.
  • Responsible for Installing, setup and Configuring Apache pinot Kafka and Apache pinot Zookeeper.
  • Ownership of SRE monitoring and analytics platform as a service. Leading Incident & problem management during platform outages and critical disruptions.
  • Manage underlying platform infrastructure components (Linux/Windows/Storage/Networks/Load balancer/Security)
  • Migrated 9 microservices toGoogle Cloud Platformfrom skava and have one more big release planned with 4 more microservices.
  • Working on the migration of mobile application from skava to cloud (Google Cloud) by making the chunk of code to microservices.
  • Extensive Knowledge and hands-on experience implementing PaaS, IaaS, SaaS style delivery models inside the Enterprise (Data center) and in Public Clouds using likeAWS, Google Cloud,andKubernetesetc.
  • SetupAlertingandmonitoringusingStackdriverinGCP.
  • Daily monitoring of alert notifications and provide resolution
  • Follow a Change Management process from task creation to completion, providing review, detail validation and execution of all tasks
  • Tuning the Application stack to improve stability and resultant uptime metrics
  • Assisted in Acute and Recurring issue investigation and resolution
  • Involved in Performance Trend Analysis, Log Analysis and Error resolution, Tuning of Platform Monitoring and Root Cause Analysis
  • Designed robust and resilient apps with high scalability and high availability
  • Created alert and monitoring system for business critical applications
  • Participate in a 24x7 on-call rotation outside business hours in support of our platform and customers
  • Coordinate with team to maintain Service Level Agreements (SLAs)
  • Able to diagnose and address issues in a timely and effective manner to the satisfaction of the client
  • Assist Managed Services Technical Writers in creation of Runbooks and other solution documentation
  • Managed network security using Load balancer, Auto-scaling, Security groups and NACL.
  • Experience in Python scripting for multithreading automation and automation of Shell scripts for build automation
  • Engineered Splunk to build, configure and maintain heterogeneous environments and maintained log analysis generated by various systems including security products.
  • Created Route tables to route the traffic to find whether it is public or private network.
  • Used Bootstrap Script for launching large number of instances in AWS.
  • Utilized AWS CLI to automate backups of ephemeral data-stores to S3 buckets, EBS and create nightly AMIs for mission critical production servers as backups.
  • Used Ansible for configuration management of hosted Instances within AWS. Configuring and Networking of Virtual Private Cloud (VPC).
  • Wrote Ansible Playbooks with Python SSH as the Wrapper to Manage Configurations of Open Stack Nodes and Test Playbooks on AWS instances using Python.
  • Implemented Capability deployment using terraform scripts
  • Wrote scripts in terraform when required and Parser files of complexity when required
  • Maintained the repository management tools like Artifactory to store the WAR, JAR files which are deployed by using Ansible in Jenkins tool.
  • Created kubernetes YAML file for the deployment in Kubernetes Cluster
  • Setup Kubernetes cluster with Kubernetes Master that TEMPhas API server, etcd, Kube-Scheduler, and Kube Controller.
  • Setup Kubernetes Kubelets that talks to the API server in Kube Master
  • Created Pods, Deployments, Services and Replication Controller in Kubernetes
  • Used Spinaker for Kubernetes Continuous Deployment and Rolling Updates
  • Experienced in using Docker Swarm and deployed spring boot applications.
  • Developed container support for cloud environments and Docker. Deployed the applications in Docker containers and managed the containers in enterprise level.
  • Created Docker images using a Docker file, worked on Docker container snapshots, removing images and managing Docker volumes.
  • Configured Docker container for branching purposes and Implemented Docker automation solution for Continuous Integration / Continuous Delivery model.
  • Created users, roles and groups using Identity Access Management (IAM) and attached the required policies.
  • Scripting of administration tasks using CLI, Bach Shell, Shell and Python Scripting. Built upstream and downstream jobs in Jenkins to build and deploy onto different environments.
  • Performed Branching, Tagging, Release Activities on Version Control Tool GIT (GitHub).
  • Built and Deployed java source code into application servers in an Agile continuous integration environment.
  • Provisioned load balancer, auto-scaling group and launch configuration for microservices using Ansible
  • Built and maintaining Docker infrastructure for Service oriented architecture (SOA) applications.
  • Managed monitoring using Nagios and updated parameters with active and passive checks.

Environment: Linux (RedHat)/Centos, Windows, AWS, Ansible, Docker, Kubernetes, WebLogic, JBoss, Oracle, MySQL, Maven, CVS, GIT/Bitbucket, SVN, Jenkins/Cloud Bee, Docker, Jira

Confidential, AUSTIN, TX

AWSDevOps /DOCKER/EKS/ SRE Engineer

Responsibilities:

  • Building the AWS Infrastructure using VPC, EC2, S3, Route 53, EBS, Security Group, Auto Scaling, and RDS in CloudFormation.
  • Handled storage over cloud with EBS and S3 policies, performed capacity planning and designing, OS upgrades and hardware refresh.
  • Installing, setup of proliant blade BL,DL series with fiber channel connectivity, Cisco GESM interconnect switch, configuring the connectivity, setting up JBOSS cluster, setting uporacle 10g cluster, configuring load-balancing on apache pinot onRedhat Linux 4.0/5.0Setting upNIS/NFS on RedHat servers
  • Worked on the Analytics Infrastructure team to develop a stream filtering system on top of Apache pinot
  • Implemented a distributed messaging queue to integrate with Cassandra using Apache pinot and ZooKeeper.
  • Used Flume, Kafka to aggregate log data into HDFS.
  • Integrated Kafka with Flume in sand box Environment using Kafka source and Kafka sink.
  • Worked with application teams to install operating system, Hadoop updates, patches, version upgrades as required. Integrated Kafka with Spark in sand box Environment.
  • Responsible for Installing, setup and Configuring Apache Kafka and Apache Zookeeper.
  • End to end deployment ownership for projects on Amazon AWS. dis includes Python scripting for automation, scalability, build promotions for staging to production etc.
  • Tested high availability of AWS-RDS-MySQL database by simulating failover from the primary database to read replicas and hence promoting read replica to the primary.
  • Managing the Openshift cluster that includes scaling up and down the app nodes.
  • Designing and implementing CI (Continuous Integration) system: configuring Jenkins servers, Jenkins nodes, creating required scripts (Python), and creating/configuring VMs (Windows/Linux).
  • Automated Build/Package/ jobs using groovy scripts by using Jenkins JobDSL
  • Experienced in setting up Jenkins pipeline jobs for build configurations using groovy.
  • Worked with OpenShift platform in managing Docker containers and Kubernetes Clus-ter
  • Designed Terraform templates to create custom sized VPC, subnets, NAT to ensure successful deployment of Web applications and database templates.
  • Had very strong exposure using Ansible automation in replacing the different compo-nents of Openshift like ECTD, MASTER, APP, INFRA.
  • Implemented Capability deployment using terraform scripts
  • Creating AWS resources with Terraform and integrated with CI / CD process.
  • Orchestrated and migrated CI/CD processes using terraform templates and Containerized the infrastructure using Docker to setup in AWS Amazon VPCs.
  • Build and deploy releases using Jenkins, Git, Openshift, XL Deploy, XL release.
  • Created Terraform modules to create instances in AWS & automated process of creation of resources is AWS using terraform.
  • Creation of High Availability of service to achieve Continuous Delivery goal on high scalable environment, used Docker coupled with load-balancing tool Nginx
  • Creating Blue/Green deployments with terraform to manage application releases.
  • Enhanced, fixed functional and performance related bugs in core product written inGO(GoLang).
  • Extensively worked on Jenkins inGoLangfor Continuous Integration tool.
  • UsedGoto run the Docker Command Line Interface tools
  • Engineered the Continuous Delivery automation processes for container technology operated onDockerSwarm
  • Implemented theDockerSwarm(Cluster), Kubernetes Cluster.
  • Containerized the Applications usingDockerfor the test environments and dev-environments and leveragedDockerswarmto maintain and monitor the Nodes running theDockerApplications.
  • Migrating fromDockerswarmto Kubernetes, setting clusters on AWS
  • Wrote Python scripts for pushing data from Mongo DB to MySQL Database.
  • Worked on Kubernetes andDockerSWARMtools to deploy containers in a cluster.
  • UsedDockerto containerized Micro services in cloud and managed clusters usingDockerSwarmand Kubernetes. Developed Private cloud system solution on CoreOS using Kubernetes.
  • Experience inGolangfor centrally deploying to cloud file system and tan have the app servers download the new software and redeploy
  • Created and configured Ansible playbooks to automatically install packages from a repository, to change the configuration of remotely configured machines and to deploy new builds and configured with Ansible Tower so that other users can run them with just a push of a button.
  • Used Vault feature to encrypt any structured data file by using Ansible.
  • Also encrypted arbitrary files, binary files using Ansible
  • Involved working on Ansible and Ansible tower as configuration management tool to automate repetitive tasks and to deploy applications.
  • Demonstrated on Ansible along with Ansible Tower can be used to automate different software
  • Coordinating Release effort amongst various teams (Development, QA, performance Testing and Production) in geographically separated environment.
  • Prepared test cases and strategies for Unit testing using Junit
  • Wrote JUnit test cases for unit, integration, and functional tests, run automatically by Hudson/Jenkins in the builds triggered by each push to GIT.
  • Experience working on DockerHub, creating Docker images and handling multiple images primarily for middleware installations and domain configurations
  • Launching and Debugging Docker Container, Installing/setup Docker host in environment, Integrating Docker and Mesos to monitor container status.
  • Used Maven as a build tools on java projects for the development of build artifacts on the source code.
  • Wrote Ansible Playbooks with Python SSH as the Wrapper to Manage Configurations of Open Stack Nodes and Test Playbooks on AWS instances using Python.
  • Generated and maintained the Project SCM plan and created hooks and managed permissions on the branches for GIT.
  • Deployed the application which was a Rest API based micro service. It TEMPhas 35 services all of them were rest end points. me also have worked to write scripts to get internal data from a different application (application that me did not maintain).
  • Used Elastic search as an open source search engine with a REST API and Kibana TEMPhas been used for Web UI Queries.
  • Used Rest APIs for http and https and WS of WSS for Web Socket APIs.
  • Using python and bash scripting and bitbucket API created a service called Resource scrapper that will capture all the resource request and limits usage for all the containers.
  • Constructed a RESTful API to manage infrastructure services using Python.

Environment: AWS, Terraform, Route53, Auto Scalling, Java/J2EE, Ansible, Docker, Jenkins, Maven, Apache, Tomcat, Selenium, Bash Shell, GIT, Python, JIRA, Nagios, Confluence, MongoDb.

Confidential, Berkeley Height, NJ

LINUX SYSTEM Engineer

Responsibilities:

  • Performed multiplatform volume management using SVM, LVM, ZFS, NAS, and VERITAS volume manager.
  • Performed regular function checks on all RHEL 5.5 Linux Servers in both environments (Production/Non-Production and/or Production/Development) especially on Apache servers
  • Red Hat Linux server administrator, including security patching, kernel tuning, RAID disk array Mgmt, software packaging and installation using RPMs
  • Troubleshoot build, install, configuration, performance, monitoring and other process issues of the servers and the selected application environments.
  • Created Change Records, walk through the construction of tasks to validating and approval steps and deploy application enhancement in Development and Production Environments.
  • Integrated Maven with Perforce to manage and deploy project related tags.
  • Installed and administered Tomcat repository to deploy the artefacts generated by Maven and to store the dependent jars which are used during the build.
  • Mentored Business Areas on Subversion Branching, Merging strategies concepts.
  • Resolved update, merge and password autantication issues in Jenkins and Jira.
  • Involved in implementing Tool Suite (Jira, Jenkins).
  • Setting up continuous integration and formal builds using Jenkins with Maven and Tomcat repository.
  • Involved partially in deploying WARs/EARs (backend) through WebLogic Application Server console.
  • Performed setup of clustered environment with WebLogic Application Server.
  • Used Shell script to configure the WebLogic application server in all the environments
  • Support Lead developers with Configuration Management issues.
  • Worked for Creating the Software Configuration Management Plan.
  • Managed all the bugs and changes into a production environment using the Jira tracking tool
  • Installation of patches and packages using RPM and YUM in Red Hat Linux.
  • Installed and configured SAMBA server for Windows and Linux connectivity.
  • Installed and configured Apache / Tomcat web server.

Environment: AWS, GIT, Jenkins, Ansible, Maven, Ant, Python, Ruby, JIRA, Shell Scripts, XML, UNIX, Linux, Perl, Mod-Perl, Tomcat, PowerShell, Docker, kubernetes, Nagios and Oracle.

We'd love your feedback!