Etl Architect/lead Resume
Hoffman Estates, IL
SUMMARY:
- ETL/Teradata/Data Architect, Lead, & Developer with 18+ years of US IT experience
- Analyze, Architect and Apply quality process frame work for ETL/Data Integration solutions
- Define, Design and Develop Quality Data Transformation, ETL/ELT and Integration Solutions
- Provide End - To-End Data Warehousing/ETL and Integration solutions using complete Agile/Six Sigma/SDLC processes or methodologies.
- Has strong experience in Data Warehouse/BI Data Architecture, Modelling and Implementations
- Has strong experience in providing complete End To End Unix/Linux-Teradata-ETL Or Unix/Linux-Datastage-Teradata architecture, design and build solutions for BI Data warehouse projects. Good working knowledge on Teradata Unified Data Architecture(UDA) framework(Teradata Aster SQL, SQL-Mapreduce)
- Has strong experience in providing complete End To End Unix/Linux-Datastage-Netezza or Unix/Linux-Netezza-ETL architecture, design and build solutions for BI Data warehouse projects
- Has strong experience in providing complete End To End Unix/Linux-Datastage-Oracle or Unix/Linux-Oracle-ETL architecture, design and build solutions for BI Data warehouse projects.
- Has strong experience working with SQL Server, Informix DB and DB2/AS400 databases in BI Space.
PROFESSIONAL EXPERIENCE:
Confidential,Hoffman Estates,IL
ETL Architect/Lead
Responsibilities:- Design, Architect and build end to end ETL/Integration solutions to extract data from multiple sources into staging, transform and load.
- Interface and work with Business teams to analyze, gather and document ETL/Integration requirements
- Analyze, conduct and review source to target requirement for each source
- Work with multiple source teams to identify source formats, configurations and access issues.
- Translate ETL/Integration requirements into design flows and STMs.
- Design, Architect and build ETL/Integration processes using ETL/Integration tools or scripts.
- Design and Build Shell scripts to configure and automate the process
- Design and build ETL/Integration processes to load data into multiple targets(Teradata, Hadoop, Mongo DB)
- Design, Architect and build data models for data flow from source to staging, landing and final targets.
- Build logical and physical data models for ETL/Integration processes
- Design, build and optimize simple to complex Sql queries for ETL/Integration process
- Configure, parameterize and product-ionize ETL/Integration and Shell scripts
- Design and build ETL/Integration optimization processes for the data flows
- Build Data automation processes for ETL/Integration flows
- Design and build audit balance scripts for validation prior to final loading
- Has experience integrating multiple sources into Hadoop HDFS file system. Good knowledge with File cleansing and aggregating using Mapreduce/Pig processes on Hadoop
- Mentor, update and work with team on design and build strategies for ETL/Integration process
- Performance tune Teradata queries for optimal processing minimizing CPU and Impact CPU cycles.
Environment: Teradata V14.x, Unix/Linux, SQL Assistant, BTEQ, DB2, Sql Server, Mongo DB, ETL/Integration Tool (IBM Infosphere/Datastage 8.1), MicroStrategy, Hadoop, HDFS, Pig and HIVE/IMPALA, Power Designer, Control-M
Confidential,Chicago,IL
ETL Architect/Lead (Datastage & Teradata)
Responsibilities:- Understand & Perform Source Data Analysis (ODS & Apartments Master DB)
- Understand, Articulate and Document the current Apartments. Confidential Data Flows, Business Model and Conceptual Data Model.
- Understand, Architect and Assist in developing a logical data model for apartments. Confidential BI Solution.
- Develop an Snow Flake/Hierarchical Data model for the Apartments BI Solution enforcing Natural Keys as well as Surrogate Keys where needed.
- Assist in Architecting/Modeling Dimensions (Type1, Type2/Slowly Changing Dimensions), Hierarchies, Facts, Bridge Tables, Groups involving Set Keys to handle 1 to many relationships
- Assist in identifying primary, join and secondary indexes for the Teradata tables.
- Understand and Implement Teradata Data Distributions utilizing Primary Indexes
- Developed and Designed ETL Process flows for current and new functionality.
- Architected, Designed and Developed an ETL solution utilizing, Extract, Stage, PostStage and Sync model for ETL jobs.
- Document, Design and Establish ETL Job templates for Extracts, Stage, PostStage and Sync Datastage Jobs.
- Document Datastage, Teradata & ETL Naming standards, guidelines and best practices for both Datastage and Teradata jobs and SQL scripts.
- Design and Implement DataStage and Teradata Jobs utilizing TPT/Teradata connector stage in Immediate and Bulk modes.
- Design and Implement Teradata BTEQ scripts utilizing Mass Insert, Update and Merge Statements for better performance.
- Design and Implement FastExport, FastLoad, MLoad scripts for larger loads.
- Develop UC-4 schedule designs and dependencies.
Environment: Teradata 14.0, SQL Server, Flat Files, XML, Putty, WinScp, Teradata SQL Assistant AIX/Unix, SQL PL/SQL, IBM Information Server 8.7 (Datastage & Quality Stage 8.7), Cognos/TM1
Confidential,Hoffman Estates,IL
ETL Architect/Lead
Responsibilities:- Performed an impact analysis of ETL scripts, Tables and databases that were to be part of the migration
- Developed and implemented an ETL design pattern to migrate existing Netezza scripts to Teradata/BTEQ scripts.
- Implemented redesigning of scripts and SQL queries to fit the needs of Teradata platform for better performance.
- Developed and implemented serial Vs asynchronous design for ETL events based on source data availability
- Developed and Designed Process flows for current and new functionality.
Environment: Netezza, Teradata, SQL Server, Informix DB, Flat Files, XML, Putty, WinScp, Netezza Work Bench, Teradata SQL Assistant AIX/Unix, SQL, PL/SQL, MicroStrategy 9.x, IBM Information Server 8.1.x (Datastage & Quality Stage 8.1), MicroStrategy
Confidential,Shelton,CT
Lead ETL/Datastage Consultant
Responsibilities:- Involved in gathering/reviewing business requirements from End users. Translate Business requirements into technical requirements for S2M processes.
- Define and Design process flows/source-to-target mappings
- Architect, design and build ETL processes for S2M reporting using DataStage
- Work with Cognos/TM1 team in developing custom ETL processes for reporting
- Involved in Data/Model Architecture for S2M reporting including Various Staging, Landing and Data-Mart objects(Tables, Views, Procedures)
- Architect, Design and Implement enhancements to the S2M data model
- Provide Test strategies and approach for S2M ETL processes/reporting
- Co-Ordinate and work with project team in delivering the ETL tasks
- Provide estimates for Data Architecture, ETL/Integration solutions for S2M
- Provide/Implement Datastage Installation, configuration and support for S2M Solutions
Environment: IBM Information Server 8.7.x (Datastage & Quality Stage 8.7), SQL Server, Flat Files, Putty, WinScp, AIX/Unix, SQL, PL/SQL, Oracle 11i, Cognos/TM1
Confidential,Hoffman Estates,IL
ETL Datastage/Teradata Architect
Responsibilities:- Facilitate and involve in gathering business requirements from End users.
- Provide Estimates, data volumes, usage and trends for capacity planning needs.
- Lead and Co-ordinate data analysis tasks, translate high level business requirements into technical requirements, associated Business rules and document those into Source To Target Mappings.
- Lead, Architect and Design Conceptual/Business and Logical/Physical Data Models for Home Appliance and Home Improvement projects involving Organizational, Product and SOAR business Hierarchies.
- Architect and design ETL Solutions for HA and HI Data Marts.
- Lead, Architect and Design Complex Netezza Scripts and processes for Data Loads from Staging to Reporting to process Billions of Rows.
- Lead, Architect and design complex Datastage process for Extracting and Loading data into Staging Tables from multiple Sources.
- Design/implement Type 2/SCD Changes in the Model
- Worked with Sources such as Teradata, DB2/UDB, Flat Files, SpreadSheets, Netezza, SQL Server.
- Worked with Shell Scripts, Control-M schedules for ETL processes.
Environment: IBM Information Server 8.1.x (Datastage & Quality Stage 8.1), DB2/UDB(Z/OS), Netezza, Teradata, SQL Server, Flat Files, XML, XSD, SOA, Putty, WinScp, Netezza Work Bench, Teradata SQL Assistant/Toad, Mainframe, AIX/Unix, SQL PL/SQL, MicroStrategy
Confidential,San Ramon,CA
Datastage/Teradata Technical Architect
Responsibilities:- Defined a process approach for technical analysis needed
- Worked with Gap Leads and technical team to indentify candidate processes for analysis.
- Identified and documented process gaps
- Documented and provided guidance on using Balanced optimization vs. TPT for Teradata
- Provided recommendations on using Datastage 8.5 options new features with migration
- Prepared initial/final analysis document for review with Teradata team and Confidential .
Confidential,Dallas,TX
ETL/Datastage Architect - Lead
Responsibilities:- Involved in Discovery, Pre-Analysis, Estimation and Pre-Bidding process for the project
- Involved in Requirement Gathering, Source System Identification, and Analysis. Architected, designed Data Analysis and Data profiling tasks and processes for Source data
- Architected, Designed and Implemented Logical and Physical data model for Quality Dashboard project including Staging, Repository and Data Mart
- Implemented Star Schema Model for Facts and dimensions, and some Snowflake Structures where Hierarchies, Bridge tables and others are needed.
- Architected, Designed, Lead and Implemented End to End ETL Processes for this project using IBM Information Server, Linux, Oracle, Flat Files, CSV Files and other sources
- Prepare and document DW, ETL, Datastage and Best Practices for the project
Environment: IBM Information Server 8.1.x/8/5, Linux/Unix, Oracle 10/11, FTP, WINSCP, Putty, SQL Developer, TOAD, IBM Data Architect, Cognos 8
Confidential,Hoffman Estates,IL
ETL/Netezza/Teradata Architect
Responsibilities:- Setup, facilitate and involve in gathering business requirements from Client users.
- Provide Estimates, build project plans, timelines and report progress of the project throughout SDLC.Gather data volumes, usage and trends for capacity planning needs.
- Perform source system analysis, Translate high level business requirements into technical requirements, associated Business rules and document
- Lead, Design and Implement Logical/Physical Data Model for Store Gaming Data Marts
- Architect and design ETL Solutions for the Sears and Kmart Staging, Data Marts.
- Architect and Design Complex Netezza Scripts for Data Loads from Staging to Reporting to process Billions of Rows. Architect and design complex Datastage process for Extracting and Loading data into Staging Tables from multiple Sources.
- Define and Implement Balanced Optimization for Datastage and Teradata Jobs.Designed/implemented Type 2/SCD Changes
Environment: IBM Information Server 8.1.x (Datastage & Quality Stage 8.1), DB2/UDB(Z/OS), Netezza, Teradata, SQL Server, Flat Files, XML, XSD, SOA, Putty, WinScp, Netezza Work Bench, Teradata SQL Assistant/Toad, Mainframe, AIX/Unix, SQL PL/SQL, MicroStrategy
Confidential,Chicago,IL
ETL/Datastage Lead
Responsibilities:- Gather Business requirements for Inter Operational Reporting for both ETL and Reports.
- Gathered historical data volume metrics/trends and usage patterns for data mart and reports to account for future growth.
- Perform source system analysis, Data profiling and mining for Blue2 applications.
- Identify reporting Hierarchies, relationships, Entities, Attributes for new Data model.
- Identify and Design References, Dimensions and Facts for the Logical Data model from Source Data (Star Schema).
- Build Change Data Capture, Staging, Star Schema tables for the Data model
- Build and review Use Cases, Design Approach, Design Flows and UATs
- Analyze, Architect/Design and build Integration/ETL solutions for the above processes utilizing IBM Information Server 8.0.1 (Datastage and Quality Stage 8.0.1), DB2/UDB, MQ, Flat Files, XML, XSD, CLOB, DB2 XML Types.Extract information from DB2, Flat Files, XML Files and Etc.
Environment: IBM Information Server 8.0.1 (Datastage & Quality Stage 8.0.1), DB2/UDB(Z/OS), MQ, Flat Files, XML, XSD, SOA, Java, J2EE, Putty, WinScp, Eclipse, DB Visualizer, DB2 Client Tools, IBM Data Studio, JCL, Mainframe, Rational Clear Case
Confidential,Chicago,IL
ETL/Integration Architect/Lead
Responsibilities:- Baxter is a Health Care Product provider and Care company that apply a unique combination of expertise in medical devices, pharmaceuticals and biotechnology to create products that advance patient care worldwide. As part of its new business initiatives, it has launched a project to Integrate/ETL the following processes.
- Interactive Response and Triage (IRT) Adverse Event Notification updates to its current JDE AS/400, DB2 System databases
- Direct Access and IRT database between each other systems.
Environment: IBM Information Server 8.0.1 (Datastage & Quality Stage 8.0.1), Oracle 10g, JDE/AS400(DB400), DB2, Flat Files, XML, XSD, SOA, Java, J2EE, C/C++, Putty, WinScp, Eclipse, Oracle SQL Navigator, TOAD
Confidential,Keene,NH
ETL/Integration Architect/Lead
Responsibilities:- C&S Wholesale Confidential of Keene, NH is the second-largest food wholesaler company in the United States. The company distributes food to supermarkets, retail stores, and military bases across the country.
- Currently, C&S serves over 5,000 stores from over 70 locations in 12 states.
- C&S has initiated a project to enhance its current model of Supplier-Warehouse-Vendor to Direct to Store model (D2S), where it can establish and monitor a process for its sales Directly to Vendors from its Suppliers. This project captures moves and provides data analysis capabilities to C&S for better administration and service.
Environment: Datastage 8.1(IBM Information Server, Parallel, Server), Datastage Packs (XML, MQ Connector, MQ Stage, MQ Read, DTS), Java/JMS Queues/Plugins, AIX, TOAD, Putty, WinScp
Confidential,Peoria,IL
Sr. ETL Designer/Developer/Tech Lead - Consultant
Responsibilities:- Worked on multiple reporting projects for Caterpillar MLDM such as NPI, CPPD, GL, GLS, GPP and LACD
Environment: Datastage 7.5/7.5.2/8.0 (Server, Parallel, IBM Information Server and OS 390), Datastage Packs(XML, MQ, SAP R/3 5.2.1, RTI), Unix/Linux, DB2/UDB, Oracle 8i/9i/10g/11g, Teradata V2R6(BTEQ, Teradata SQL Assistant, FastExport, FastLoad, MultiLoad, TPump), ERWIN, XML, XSD, Mainframe OS/390, Cobol, JCL, VSAM, PDS/PS, ISPF, TOAD, Syncsort, Putty, WinScp