Lead Cloud Operations Engineer/devops Resume
SUMMARY:
- Lead Cloud Engineer; 10+ years design, deployment and migration of Linux OS and Cloud Infrastructure technologies to include VMware, Terremark and AWS Cloud.
- Design and deploy web applications utilizing AWS stack, including VPC, EC2, ELB, Security Groups, NACL, NAT Gateway, S3, RDS, Dynamo, IAM and CloudFormation.
- Build Virtual Private Cloud (VPC), as well as migrating from one VPC to another.
- Migrate on premise infrastructure, platform, software into AWS Cloud and VMware.
- Lead successful development, deployment and operations of Cloud Infrastructure for the Federal Government and Global Clients within timeframe with minimal to no downtime.
- Technical liaison, report directly to Director of IT solutions and services at customer site.
- Lead development of strategic IT roadmap; analyze & design IT infrastructure solutions.
TECHNICAL SKILLS:
Operating System: Red Hat Linux, SUSE Linux (SLES), Oracle Enterprise Linux (OEL), Centos 4/5, Sun Solaris 9/10, Microsoft Windows Server 2010 R2/2008
Cloud/Virtualization: Amazon Web Services (AWS), VMware, vSphere, vCenter, Terremark/Verizon Cloud, CGI cloud, Monsoon, Cisco UCS, DS3 Data Vault(Backup/Restoration tool)
Automation: Chef, Puppet, Jenkins
Version Control: GitHub, Mecurial
Web Applications: Tomcat, SAPJVM, Tomcat/JBoss, Apache, IIS, BOXI3.1, Confidential HCM, Crystal Report, Wordpress, Percussion, iContent, Drupal6/7, Jira, Confluence, Zabbix, Splunk, Sitescope, Nagios, Veritas Netbackup
Network/Security: AWS (ELB, Route 53, Security Group, ENI, EIP, CloudFront), DNS, F5TCP/IP, PKI, SSL Certificates, SFTP, SSH, DNS, Akamai Edge Con
Storage: AWS S3, AWS Glacier, NetApp, EMC (SAN)
Database: AWS (RDS, Dynamo DB, Elastic Cache, Redshift), Oracle 11g and earlier
MySQL 5.7 and earlier version, Microsoft SQL server
Languages: Shell Scripts, AWS CLI, Perl, Python, Ruby, Java, XML, HTML, PL/SQL
Design Tool: Draw.io, Microsoft Visio
Hardware: NETAPPFAS3100, IronPort(M650/C350), CiscoSwitches3560, SRW2086, Sonicwall, UPS(APC/Tripp Lite), DigiView Rack, CAT5 Cable, SPARC Enterprise M3000, M4000, SunFireV245, V480, V490,V880, V210(Blade Server), Sun E3500, E4500, T5240, HP ProLiant DL380 servers, Dell PowerEdge 1850, 2900, 2950
PROFESSIONAL EXPERIENCE:
Confidential
Lead Cloud Operations Engineer/DevOps
Environment: AWS(Full Stack), Hybrid Cloud, Physical Servers, Cluster(HA), VMware(ESXi5.5), Red Hat Linux, SUSE Linux(SLE), Oracle Enterprise Linux, Microsoft Windows Server 2010/2008R2, CISCO UCS, Apache, IIS, Tomcat/JBoss, F5, BOXI3.1, Crystal/BIRT Report, Confidential Successfactors(HCM), Oracle, Microsoft SQL Server, MySQL, Active Directory(DNS), Chef, Puppet, Zabbix, Splunk, Sitescope, Manage Engine, Ironport(Sendmail), Jira, Confluence, SAN, SFTP(Globalscape), Akamai
Responsibilities:
- Lead complex troubleshooting issues
- Migrated on - premise infrastructure, platform, software into AWS Cloud and VMware
- Designing and deploying web applications utilizing almost all of the AWS stack ( including VPC, EC2, ELB, Security Groups, NACL, NAT Gateway, S3, RDS, Dynamo, IAM, CloudFormation, CloudFront) focusing on high-availability, fault tolerance, and auto-scaling
- Working with management in making key decisions on AWS Instance Type to be used for better economies of scale
- Working with Chef Automation tool to manage configuration and deployment of EC2 and VMs. As well as Continuous Integration and Delivery.
- Build chef server, workstation and bootstrap nodes to communicate with chef server.
- Build Version Control Server - GitHub private repositories
- Create S3 backups using versioning enable, lifecycle and moved objects to Amazon Glacier for archiving purpose.
- Create users, groups and roles using AWS Identity Access Management (IAM) and assigned individual policies to each group.
- Use AWS CloudTrail for audit findings and Cloud Watch for monitoring AWS resources
- Building ESXi Host Servers, Virtual Machines and Physical Database servers for Cloud environment
- Manage Storage Usage and ensure High Availability for all Servers and Applications
- Use automation (Jenkins/Puppet/Chef/), Bash, Ruby and Python Scripting to provision, deploy and build multiple web applications
- Build Enterprise Monitoring Tools(Nagios/Zabbix) to monitor availability and optimizing the performance for Web Applications as well as Operating Systems (Memory, CPU, disk)
- Collaborates with other teams(Database/Security/Network) in building and maintaining hosted applications
- Maintain Cloud environment by performing continues patching, upgrading OS Kernel, Remediating security vulnerabilities.
- Execute Ad hoc SQL Queries, and make updates to databases if needed, working in collaboration with Database Team
- Perform log management with Splunk and Global systems configuration change with Chef/Puppet
- Maintaining system security policy, including F5(load balancers), host and client access, file permissions and user accounts
- Create/Modify technical process documentation using Confluence and Jira
- Act as an on-call technical resource as needed during off business hours
- Responsible for local inventory maintenance and software license management.
- Monitors security compliance
- Create skills requirements for new roles as well as interview for new candidates
- Provides training and mentor new Systems Administrators or Cloud Engineers.
- Conduct Weekly status meetings for systems administrators or Cloud Engineers
- Design and build work books for key applications usage
- Research for third party application tools as well as interview vendors for their products through demos
- Set up maintenance plan and communicate with Global Clients
Confidential
Senior Cloud and Linux Engineer
Environment: Cloud Infrastructure(Terremark/Verizon), CGI Cloud, VMware(ESXi4.1) Redhat 5 and above, Microsoft Windows Server 2008, Apache, Tomcat, MySQL, PHP, Oracle, Phone Factor, Active Directory, LDAP, Nagios, Drupal, Percussion, Mediawiki, Sendmail, SMTP(Postfix), Jira, Confluence, DNS,NFS,NAS, Wordpress, loggerNet, Akamai
Responsibilities:
- Operations and Maintenance of all websites hosted in GSA-OCSIT CGI cloud infrastructure. Few examples were usa.gov, business.usa.gov, kids.gov, howto.gov, data.gov, itdashboard.gov and performance.gov
- Designed, deployed, configured, and maintained Microsoft Windows/Linux servers in the cloud environment
- Installed, configured, and tracked Operating System patches
- Installed, configured, and tracked application patches to ensure availability and integrity of the applications/services they provide
- Deployed, configured, and maintained Percussion (Rhythmyx/CM1) Content Management servers
- Designed, Deployed, configured, and maintained Apache, Drupal base webservers
- Designed, Deployed, configured, and maintained Tomcat Application servers
- Designed, Deployed, configured, and maintained MySQL/Oracle Database servers
- Reviewed Nessus Scan Report for vulnerabilities, research and mitigate affected systems
- Created scripts to sync web content from origin to Akamai Edge Suite
- Created scripts to monitor server performance, nightly backups as well as cron jobs for other automated activities
- Updated DNS entries (SPF, MX and A records) for applications sending outgoing emails
- Managed Netstorage and redirects rules using Akamai Edge Suite or Luna Control Center
- Maintained phone-based Two Factor Strong Authentication system for content managers
- Troubleshoot and resolved complex IT infrastructure and application issues
- Provided training and documentation for new systems administrators
- Set up a Mini Lab to host NREL Solar Decathlon scoring engine. The Lab was made up of HP Proliant DL 380 servers running VMware ESXi server 4.1, hosting 5Virtual Machines with guest OS as Redhat, CentOS, and Microsoft Windows Server 2003. The Network Infrastructure included Cisco Switches connected to data loggers collecting real-time data. Then provided remote access for Solar Decathlon organizers and developers for testing and validating configuration.
- Built and hardened virtual machines for the NREL external hosting facility
- Designed, developed and implemented LDAP for central user accounts management
- Designed, developed and implemented Nagios to monitor all the major IT components
- Developed and implemented configuration management system(Subversion) for storing code, documents and versions of documents
- Analyzed OS and other system logs to look for security breaches and errors
- Performed disk expansion by adding virtual hard drive to existing Virtual Machines using Logical Volume Management(LVM)
- Applied updates to word press (Solar Decathlon blog)
- Used Akamai Edge Control Application to delivery web content, redirect and purge URLs.
- Maintained Oracle 10g database by applying critical patches, performing new oracle product installations, taking backups and restoring databases using tools like RMAN and Korn Scripts for weekly backups.
- Resolved major Oracle database application issues. Played a key role in the implementation of GSA new Change Management System supported by Oracle 10g database and JIRA/Confluence
- Implemented and maintained Apache web servers by setting up virtual hosts for multiple websites as well as writing redirect rules.
- Maintained SMTP servers making sure mails from the servers in the cloud are processed successfully.
- Co-ordinated with security team to fix vulnerabilities reports by Nessus Scan including turning off vulnerable services, forcing servers to use SSL and auditing user accounts.
Confidential
Senior Unix Engineer
Responsibilities:
- Performed installation and configuration of Sun Solaris Operating System on new sun servers T-Series (T5240) and M-Series (M4000/M3000).
- Installed Sun patch cluster on new and existing servers.
- Provided security administration to about 70 servers making sure they are in compliance with BLS security policies.
- Maintained Mail Transfer Agent (Postfix) on external DNS running Bind 9.6.1.
- Created a new Sender Policy Framework (SPF) for BLS
- Tracked email messages and spam filtering using Ironport M650 and C350.
- Processed and released quarantined email messages on BLS mail server.
- Maintained the overall Internal and External DNS (BIND 9.6.1) for BLS.
- Installed and maintained a single sign-on authentication system (SSH Tectia)
- Installed patch cluster to test, development and production servers every quarter in order to keep operating system up to date.
- Wrote Scripts to maintain log files, monitor server availability and backup data.
- Maintained Perl script that monitors the availability of DNS servers, Production servers and Netcool (Network Appliance)
- Monitored system and network performance and resolve issues.
- Maintained monthly audit report and documentation on all Oracle, Weblogic and DNS servers.
Confidential
Linux / Unix Systems Administrator
Responsibilities:
- Provided system administrative support for Sun Solaris/Red hat servers.
- Initiated the installation of the Solaris/Red hat Enterprise operating systems from both local and network based installation media sources.
- Configured servers to remote disk storage from Network Appliance SANs, Dell/EMC storage arrays and performed cluster administration
- Checked backup logs daily and corrected problems with backup software. Restored files from full and incremental backups.
- Installed patches to keep software up to date.
- Administered NFS and NTP on Sun and Redhat Linux servers
- Developed bash shell scripts to monitor disk usage, CPU usage, and identify run away processes
- Configured and implemented Solaris Containers, Zones, Virtualization and ZFS
- Harden Linux/Unix servers by turning off vulnerable services
- Identified, documented, tracked, controlled, and audited configuration items and baselines within the UNIX/LINUX NETWORK environment