Site Reliability Engineer Resume
SUMMARY:
- AWS certified
- 6 years experience with high scalable Confidential 's 500 websites, DevOps and SRE
- 10 years on Linux, Windows, Cisco
- Excellent troubleshooting and problem solving skills.
- Experience with both Open source and Proprietary applications
- Ability to learn quickly and adapt to changing priorities and requirements
- Startup incubation and Confidential 500 experiences in Silicon valley/Bay Area
- Frontend and backend experience
- Experience building and publishing iOs apps for iphone/ipad and Confidential Watch
TECHNICAL SKILLS:
AWS, Linux, Apache, Git, shell, Vagrant, Docker, Puppet,Chef, Git, OpenStack, Runbooks,Route53, dynatrace, akamai, Netscaler, HAProxy, Apache Traffic Server, Spark, Splunk,, Kibana, ElasticSearch, Logstash, Apache Nutch/Solr, Hadoop Ambari, Nagios/Cacti/zabbix, EC2, Service now, Jive, Linux, sumo,datadog JAVA,HTML5/CSS3, MySQL, PHP Codeigniter, Zend(Magento),Linux, Apache,IIS, PHP, XML,,JavaScript, JSON,XML, EDI/API, NoSQL,RUM,PageSpeed, CDN,YSLOW, pingdom, webpagetest,caching, varnish, memcached, apc,nagios, cacti, monarch, zabbix, new relic, splunk, Page Speed, Unix
WORK EXPERIENCE:
Site Reliability Engineer
Confidential
Responsibilities:
- AWS certifications
- 3 - 5+ years of experience working with AWS- must know AWS command line and be able to upload files and create policies
- Needs to have AWS cloudformation experience and be able to create a stack with ELB, EC2 and RDS
- Must have experience working with GIT- must understand push, pull, resync branch, and pull request workflow
- Must have basic programming experience- basic data structures: array, hash(associative array)/ looping and conditionals
- Jenkins experience- should be able to create a freestyle job
- Docker experience - be able to pull and run a container from dockerhub and ECR
- Experience working with Puppet- know how to troubleshoot simple failures (manifest error, missing dependency, failed node classifier)
Environment: Linux, AWS,Puppet, Maven, Chef, Git, java tuning, Java Garbage Collection(old, young generation), Java Heap, Docker,Jenkins, Sumo, Datadog, Document
Site Reliability Engineer
Confidential
Responsibilities:
- 3+ years of experience with Windows, supporting/troubleshooting in a production support environment.
- Experience with .NET 3.5 and 4.x applications on IIS 6/7.x.
- 3+ years of Networking experience, specifically strong knowledge of TCP/IP.
- Experience with Load Balancers, specifically Net Scalar.
- Experience with Clustering.
- Experience with VMware (configuration is preferred).
- Experience with monitoring/content delivery tools such as Foglight or Akamani.
- Experience establishing process exposure
Environment: hadoop, ElasticSearch, Kibana, logstach, splunk, WEB, Java based web-app deployment, nagios, support, linux, Apache Tomcat, WAR, Git, vagrant, deployment, jenkins, Docker, TCP, on-call,AutomationContinuous integration(Git and co)+Continuous Delivery (jenkins and co)
Site Reliability Engineer/DevOps
Confidential
Responsibilities:
- Implementing and maintaining monitoring systems
- Facilitate the needs of dependent teams (engineering, QA, operations) and work with external teams to achieve results.
- Manage QA and production deployments for many applications.
- Document new procedures and modifications to environments.
- Support production and non-production application environments.
- Strong knowledge of Linux and a good understanding of networks (TCP/IP fundamentals, firewalls, routing).
- Experience troubleshooting problems and working with cross-functional teams for resolution.
- Familiar and comfortable with the following: Apache, tomcat, Java, Subversion/GIT, truss/strace, network analysis tools (e.g. tcpdump/Wireshark).
- Understanding of load balancers and Layer 7 traffic routing via NetScalers or equivalent.
- 3+ years in a Systems Engineering/DevOps role.
- Familiarity with Hadoop.
- Experience with Nagios monitoring.
- Fluent in at least one scripting language in addition to Bash (Python/Perl/PHP/Ruby), Python experience highly desired.
- Desired above all else: aptitude, enthusiasm, and thirst for knowledge/digesting new technologies.
- Host Procurement
- Tomcat Setup
- Netscaler Loadbalancer Configuration
- Firewall Request
- Deployment
- SSL Certification Request
- Splunk Setup Request
Site Reliability Engineer
Confidential
Responsibilities:
- Apache, Unix/Linux, Web Applications, Managing Web Sites, Scripting Languages.
- Exposure to tool/product development
- Experience with high-volume websites (2 years)
- Strong knowledge of Unix/Linux, Apache, performance tuning concepts, and web applications
- Strong written & oral communication skills are essential. Proven ability to write bugs, test cases, problem reports
- Ability to rapidly learn and assimilate knowledge of complex software and systems, and apply understanding of system architecture when planning operational tasks and strategy
- Demonstrable experience in one or more languages such as: shell scripting, Perl, PHP, or Java or C is also a plus
- Experience with statistical analysis of defects and system performance a plus
- Strong knowledge of TCP/IP networking, SMTP, HTTP, load-balancers, highly available network servers
- Identify the priority and criticality of incoming alerts and prioritize appropriately
- Diagnose & repair issues using critical knowledge of Apache, UNIX processes, MySQL and related technologies within the OSI stack.
- Track issues through the ticketing systems and follow through to resolution
- Utilize monitoring tools to proactively identify issues and trends
- Write clear and concise operational runbooks
- Escalate significant issues to service, network or other operations engineers
- Lead by example, deliver results and eliminate missed opportunities
- Ideal candidate will possess a broad range of computer science skills. The candidate must be persistent, result oriented, and a self-starter.
Environment: Haproxy, Apache Traffic Server, hadoop, ElasticSearch, Kibana, logstach, splunk, Linux,aws, nagios, cacti, monarch, zabbix, new relic, splunk,jira, zendesk, flash, chef, ec2, git, deployment, AutomationContinuous integration(Git and co)+Continuous Delivery (jenkins and co)
Site Reliability Engineer
Confidential, San Jose, California
Responsibilities:
- 5 or more years of experience in Windows Server and Linux administration in an Internet-focused production environment.
- Red Hat Linux administration is a big plus.
- Experience supporting 24x7 mission critical Internet applications.
- ASP and online hosting experience a huge plus.
- Thorough understanding of networking concepts and Internet protocols.
- Familiarity with hosted application service provider environments, including remote administration of devices.
- General database experience, including aptitude writing SQL and administering Microsoft SQL Server and MySQL Ability to code in Perl and shell script (and learn other languages quickly).
- Familiarity with Python, bash shell, Jenkins build system, perforce, Unix basics (ps, ssh, xfs filesystem), and RAID a plus.
- Programming aptitude, particularly in web-related languages.
- Experience with JBoss and/or J2EE environments.
- Familiarity with a formal software release process, including the software build process and source control.
- Excellent communication and prioritization skills.
- Ability to learn quickly and adapt to changing priorities and requirements.
Environment: CMS, ERP, PHP, JS, HTML, linux, s3, rackspace, wordpress, magento, joomla, drupal, ecommerce,DOM, netsuite ERP, CRMSpeeding Up Loading Time, PageSpeed/SiteSpeed,DOM, CDN, velocity,YSLOW, pingdom, webpagetest, time to first byte, start to render, image/text compression, caching, varnish, memcached, apc, defer javascript, inline css, minify js/css, Web Page Optimization/Front-End Optimization, nagios, cacti, monarch, zabbix, new relic, splunk, Page Speed, ab, top, wordpress, magento