Infrastructure Architect / Sre Devops Architect Resume
Emeryville, CA
SUMMARY:
- Technical leader with over 15 years of large scale site and infrastructure management. Passionate about resilient and flexible automation around best practices and standardized processes, continuous improvement, management / monitoring, and deploying microservices to SPITaaS. Strengths include communication, collaboration, problem solving, strategic & tactical planning, as well as team building, organizational & process governance development, mentoring and managing technical staff.
TECHNICAL SKILLS:
SKILLS: Program Management: project planning, development, scheduling, coordination, resources, refresh, P migration, methodologies/strategies, QA/test, automation, deployment, capacity planning & performance Automation: shell(csh, bash, sh), awk, sed, tcl, perl, php, python, ruby(JSON), management/monitoring UNIX and its variants, Linux, Solaris, AIX, IRIX, HP - UX, SCO, Mandrake, SuSE, Red Hat(RHEL), Fedora, United, ubuntu, Debian, MacOS-X, CentOS, VMware GSX, ESX, VIC, NetApp OnTap, Citrix, Windows Server 2K3, 2K8, 2012, Powershell: Build, Configure, Install, Test, Deploy, CI/CD/CT Solution Stacks, Deployed Systems: UNIX/Linux, VMware, and Windows/Azure; Apps & Tools Architecture (TOGAF): App, Web, DB, MQ, Management Tiers, system spread, network backbone, Clustering, Redundancy, High Availability & Failover, RAID storage and Data Center topologies, Rack/Stack Web/App servers: Hydro, WebSphere, Apache Tomcat, Java, J2EE, JRE/JVM, JBOSS, IIS, middleware DevOps / Process BDM: Integration, Build & Release, Patching & Upgrades, Deploy, DR, Change Management, SRE, Incident Triage/RCA Management, Continuous Improvement, Incident Prevention Compliance & Risk Management: InfoSec, Encryption@REST, HIPAA(PHI/PII), PCI-DSS, SOX, DOD/FedRAMP, DR audits & PoC
PROFESSIONAL EXPERIENCE:
Confidential, Emeryville, CA
Infrastructure Architect / SRE DevOps Architect
Responsibilities:
- Put in place proactive, automated monitoring 24x7x5minutes with prescriptive action on Red threshold healthcheck actionable events, thoroughly documented and detailed of Ops groups on troubleshooting, triage, remediation, platform architecture, secure messaging flows in Wiki and Runbook have effectively prevented hundreds of production incidents, contributed to a large reduction in frequency of incidents that has been noticed throughout the organization and customer base.
- Maintained & mapped out RelayHealth Infrastructure & Network Architecture at Data Centers and migration roadmap to Azure Cloud, physical and VM server spreads, network backbones and solution stacks for multiple platforms including Clinical Health Systems, CommonWell Platform Services separating out for each environment, the Management Tier, Web Tier(Hydro Web services, API), App Tier with data message flows RabbitMQ services and message queues, Cloverleaf, DB Tier (SQL Server and MongoDB and overall secure messaging flows to DB updates)
- Completed architecture, network infrastructure, security, data flow, & DR compliance diagrams satisfactory for DOD audits, contributed architectural designs, IPAM & VLAN segregation layout & Data Center layout / breakdown which also contributed to a multi-year DOD contract renewal with RelayHealth’s largest & most strategic customer. FedRAMP audits & DOD compliance is more stringent than HIPAA on PHI / PII.
- Took over Systems Architecture, DevOps/SRE (24x7) monitoring, management and BDM deployment of RabbitMQ secure messaging App, ran 6AM platform readiness checks, RMQ proactive monitoring healthchecks(JSON/PS1) and eventual (24x7) full automation of all(14) RMQ clusters in all environments, maintenance, operational stability, documented procedures and operations(build, deploy, maintain) in highly detailed, thorough RMQ Runbook, 700+ queues(consumers & prefetch counts) and messaging flows, message throttling, and pushed TOI/ of RMQ and Hydro Messaging, Messaging Flows, integration with Cloverleaf on HL7 messaging throughout the organization including Dev, DCS, NOC, SOC, SRE and NTT outsourcing. Managed RabbitMQ Solution Stacks: RMQ3.3.x to 3.6.10, Erlang 1 .3.2, yum, puppet, nails, vormetric, Riverbed, SCOM, RMQ Mgmt. Console, and VMware Solution Stack. Completed first successful RMQ DR PoC in cloud environment using suboptimal VM clones. Site Reliability team in the creation of a playbook to address and triage any production issues should they arise. Developed proactive monitoring telemetry with prescription action for Red and Yellow thresholds that were actionable. Utilized Agile deployment methodology.
- Operations rapidly evolved into SRE team of elite SMEs from various skillsets to tackle numerous platform issues effecting operational stability and solving numerous Production Incidents. Provided guidance and analytical expertise to engineering staff and leaders up to VPs.
- On numerous Production Incidents served as Incident Manager, OnCall Manager, Communications Manager roles, focusing on customer impact and immediate remediation, RMQ SME and managed RCAs, intermediate term prevention, longer-term remediation and incident action items, gap analysis, especially in management/monitoring using Jira and Kanban boards, services, message flows, RMQ Runbooks on Atlassian Confluence WIKI. Multiple daily production incidents reduced frequency to customer impactful 911/811 incidents of between 55-60 days as RMQ, Operational Stability and more proactive automated monitoring was put in place.
- Monitored availability and performance of RMQ and overall Relay Health production, integration, demo (customer facing environments) products, IAM, tools, monitoring, systems, VMs, datacenters and cloud migrations. Worked closely with internal(NOC, SOC, DCS, SRE, NTT, Dev) and external vendor teams to identify, mitigate, and drive a resolution of issues related to clinical platforms, network, systems, and applications. Provided technical, monitoring, diagram walkthrus and tools to increase our service level and improve our responsiveness.
- Current state assessments, investigated, Aggregated, Identified, and prioritized Enterprise/Infrastructure and Operational Weaknesses for resolution, especially in areas of Technical Debt, Technology Refresh, Architecture opportunities for improvement, Operational Stability. Best Practice Process and Governance improvements, and SLA / KPI Vulnerability
Confidential, San Francisco, CA
Technical Project Mgr / Technical Consultant / IT Solution Architect
Responsibilities:
- Implementation Consulting, Professional Services, Installation, Configuration, Setup, Testing, Performance Measurement & Optimization, Qualification, Customer Support & Technical Documentation, Data Migration/Ingestion, Operations/Supervisor Management & Monitoring for multiple HPE services clients including CSAA, Confidential, McKesson, Wynn Int’l, Amgen, Shell Oil, SCAN Health, Confidential, CitiBank, JPMorgan, Bank of Nova Scotia and Merck.
- Provide detailed customer/client documentation Runbook / Playbook for HPE Autonomy Professional Services project implementation, configuration, maintenance, solution design through post-implementation support, research and evaluate customer technology to assure HP product compatibility, right-sizing, performance and operability.
- DB2 to Oracle(10g, 11g) SQL Conversion, Data and Database Migration; New Solution/Product Proposals: Enterprise IDOL DataNOC, DB2 and SQL Server data migrations to Oracle for healthcare clients Merck and Schering-Plough
- HP/Autonomy Portfolio Products: Access Anywhere (AAA), Consolidated Archive System(ACA), Process Automation System(APA), Records Management System(ARM), Collaborative Classifier(ACC), Control Point(CP), Digital Safe(DS & DSMail), Enterprise Archive System(EAS), Early Case Assessment(ECA), Worksite, Intelligent Data Operating Layer(IDOL), Fast Search & Transfer(FAST), IDOL Universal Search(IUS), Introspect (i6), Legal Hold(ALH), Supervisor(S6), connectors, tools and all built on IDOL context-sensitive, enterprise class search engine.
- VM-based solution lab sysadmin/setup of multiple customer PoC and test/evaluation domains, maintenance & administration including Linux(RHEL, CentOS) and Windows Server 2K3/2K8 servers. DevOps support of HPE solutions deployed.
- Massive Data aggregation of technical, product/solution implementation information, ITIL, tools, client projects, Runbooks & documentation into shared TMO Worksite è TMO Sharepoint + Professional Services WIKI (Confluence) + TechHelp WIKI (MediaWIKI) + JIRA
Confidential, San Francisco, CA
Infrastructure / Systems Architect
Responsibilities:
- Infrastructure architecture designs, capacity planning, design governance, technology and vendor evaluation, technology roadmap planning to meet the needs of various banking platforms serving millions of customers‘ and business‘ online banking needs. Solutions to meet banking level security architectural requirements, access authorization/authentication(IAM), network & security infrastructure(IPAM) logging, Mgmt.
- Developed & maintained Standard Server Solution/Technology Stacks: Web(Sun One, Apache, analytics), App(WebLogic, tc server, Introscope), DB(SQL, Teradata, Oracle 10g/11g, RAC, replication), Batch(Autosys, cron), Messaging(MQ, Active MQ), ADC(Network, firewalls, F5, DataPower), and tiered Storage (NAS, SAN, RAID 1,5,6,10,15,50, ZFS, Veritas, LVM, backup)
- SLAs (>FourNines achieved), IPv6, standardization to improve efficiencies, leverage best practices, and reduce costs.
- Architect and diagram transaction messaging flows, infrastructure hardware, Bills of Materials/Pricing/Quotes, connectivity diagrams, logical deployment, design specifications, system deployment, storage & backup, HA & BCP for a multi-tiered, multiple DC deployment environments, detailed diagram and documentation of architecture/design using Sparx Enterprise Architect & Visio software.
- Collaborate with security, development, business units, operations, configuration management, audit teams to design, test, develop, deploy, and deliver highly available solutions which meet financial institution security requirements using broad knowledge/expertise in infrastructure: servers, OS, applications, web, database, batch, messaging, application delivery controllers, and storage for multiple data centers encompassing 80K servers from Dev, PIT, SIT, UAT, QA, PTE, Pre-Production to Staging & Production(Active/Active/Active) environments. Pragmatic use of virtualization, P2V, software/hardware upgrades and standardized technologies on Windows, RHEL, and Solaris platforms.
- Release, build, deployment & delivery management, product/app software & platform upgrades & maintenance, technology refresh, scalability and capacity planning growth on Windows Server, RHEL, and Solaris platforms.
- High use of Solaris Zones, Containers, Physical & Logical Domains, and VMs for virtualized environments and internal private clouds.
- Architecture design of banking solutions for Internet Services Group(ISG), infrastructure maintenance including software, hardware, security upgrades and technology refresh, management/monitoring infrastructure as well as capacity planning/growth on programs including WF main/menu page and SEO, Online Sales and Marketing Program(OSMP) including lending document processing for mortgage and loan banking, HE and HELOC, Online Application Status(OAS), Customer Needs Assessment(CNA), Business Online Banking(BOB), BOB Desktop Deposit, BOB BillPay, banking analytics monitoring DB (Oracle 11g accessing multiple datastores including Teradata), Content Management System replacement for legacy Documentum system
- Implemented bank-level security initiatives including SSO, SIMS, SAML, Channel Secure, 1024-bit to 2048-bit encryption migration of SSL/digital certs used on all Internet Architecture & Engineering(IAE) platforms/servers including “mutual authentication” when dealing with 3rd party CA trusted sites, use of Tealeaf and F5 ASM trace routing on all external facing IAE systems.
Confidential, Holliston, MA
Senior Manager, Infrastructure Architect, IT IS Consultant
Responsibilities:
- Evaluation/assessment of client IT infrastructures, identification of pain points, process/operational problems to cure pain points & service deficiencies. Working with multiple location teams, CoEs, and clients around the globe. Proposed recommendations and game plans to refresh technology, improve HA/BCP, resiliency, greener solutions, tiered architectures and moving toward modern, cost effective and efficient, virtualized and cloud infrastructures with improved governance, ITIL/ITSM, migration plans.
- Multiple Data Center, HPC, P&V, and Cloud Computing strategies, pragmatic virtualization, tiered infrastructures, database, data warehousing, aggregation, consolidation, optimization, improved efficiencies and service levels, utilization & effectiveness of infrastructure, availability, performance, scalability, capacity planning, system resilience, and services/operational capabilities, process/governance. Implementing SuperClouds, private internal clouds, and enterprise/business Apps & solutions, the SPITaaS. Improvements in Operational Stability, Performance/Response Times, Process, & Automation
- Assess, Design and architect, cost-effective, optimized, highly available infrastructure and solutions for client IT IS needs
- RFP solution proposals to optimize apps, DEV, TEST/QA, PROD & DR compute environments to improve scalability, efficiency, & performance, applications/client solution infrastructure, high availability, DR capability, SLAs while addressing security, privacy, & compliance requirements such as SOX, SAS70, PCIv3 DSS, HIPAA, FINRA
- IT IS Ops, Delivery, Technology Refresh, Data Center design, consolidation, transformation, migration, optimization, P2P lift & shift, P2V, V2V, VM virtualization, improved cost effectiveness, Green IT, scalability and performance, management & monitoring, reduce infrastructure footprint, managed hosting, grid and cloud computing environments, IT IS services, delivery and deployment, upgrade, maintenance and support on Windows Server, Linux/UNIX Data Center environments
- Managed Confidential ’s pipeline of IT IS infrastructure deals with multiple CTS clients and strategic partners in the US, Asia, & Europe: Working with numerous offshore & onshore application teams and onshore hosting partners to support collocation of infrastructures, managed hosting for Production, Dev, UATest/QA, and DR environments with dedicated servers and both open/dedicated cloud computing capabilities. Apps included custom, FACETS, SAP, Oracle, SQL Server, WebSphere, WebLogic, ATG, portal, MOSS, VMware, & infrastructure/middleware software. Developed Service Catalogue Descriptions, Pricing Models, DC Environmental Power Consumption Tools and applied them in client projects. Hosting monitoring & management with CTS OnTarget and HP OpenView
- Major Medical/Healthcare Group: Mercy Hospital, Philadelphia, PA: Map existing infrastructure and identified problems with data flow and operations, made re-architecture and transition recommendations for data center (DC) infrastructure and transactional throughput for FACETS application on HP Blades with Sybase replication.