Etl Developer Resume
Plano, TX
SUMMARY
- Five plus (5+) years of IT experience in the Analysis, Design, Development, Testing and Implementation of business application systems for Banking, Confidential, Pharmaceutical, Financial, Telecom and Manufacturing Sectors.
- Worked on DataStage tools like DataStage Designer, DataStage Director and DataStage Administrator.
- Strong understanding of the principles of Data Warehousing using fact tables, dimension tables and star/snowflake schema modeling.
- Strong experience in the Analysis, design, development, testing and Implementation of Business Intelligence solutions using Data Warehouse/Data Mart Design, ETL, OLAP, OLTP, BI, Client/Server applications.
- Strong Data Warehousing ETL experience of using Informatica 9.1/8.6.1/8.5/8.1/7.1 PowerCenter Client tools - Mapping Designer, Repository manager, Workflow Manager/Monitor and Server tools, Informatica Server, Repository Server manager.
- Worked extensively with Dimensional modeling, Data migration, Data cleansing, ETL Processes for data warehouses.
- Extensive testing ETL experience using Informatica 9.1/8.6.1/8.58.1/7.1/6.2/5.1 (Power Center/ Power Mart) (Designer, Workflow Manager, Workflow Monitor and Server Manager) Confidential and Business Objects.
- Developed parallel jobs using different processing stages like Transformer, Aggregator, Lookup, Join, Sort, Copy, Merge, Funnel, CDC, Change Apply and Filter.
- Used Enterprise Edition/Parallel stages like Datasets, Change Data Capture, Row Generator and many other stages in accomplishing the ETL Coding
- Extensive ETL tool experience using Confidential Infosphere/Websphere DataStage, Ascential DataStage.
- Familiar in using highly scalable parallel processing infrastructure using parallel jobs and multiple node configuration files.
- Experienced in scheduling Sequence and parallel jobs using DataStage Director, UNIX scripts and scheduling tools (Autosys).
- Experience in troubleshooting of jobs and addressing production issues like data issues, ENV issues, performance tuning and enhancements.
- Extensive experience in design and development of Decision Support Systems (DSS).
- Assisted in development efforts for Data marts and Reporting.
- Extensive experience in Unit Testing, Functional Testing, System Testing, Integration Testing, Regression Testing, User Acceptance Testing (UAT) and Performance Testing.
- Worked with various databases like Confidential 10g/9i/8i, DB2, SQL Server, Confidential .
- Extensive experience in writing UNIX shell scripts and automation of the ETL processes using UNIX shell scripting.
TECHNICAL SKILLS
ETL Tools: Confidential Infosphere DataStage 8.5, Confidential Infosphere DataStage 8.1 (Parallel & Server), Confidential Websphere DataStage 8.0.1 (Designer, Director, Administrator), Ascential DataStage 7.5.2 (Designer, Director, Administrator, Manager), Informatica
Database: Confidential 10g/9i/8i, Confidential DB2/UDB, Confidential, SQL Server 2003/2005/2008.
Data Warehousing: Star & Snow-Flake schema Modeling, Fact and Dimensions, Physical and Logical Data Modeling, Erwin, Cognos
Operating systems: Windows 7x/NT/XP, UNIX, LINUX, Solaris, MS-DOS,MS Access
Languages/Scripting: C, C++, Java, D2K, Visual Basic, PL/SQL, UNIX Shell scripts
Testing/Defect Tracking: Confidential Quality Center, Test Director, Bugzilla
PROFESSIONAL EXPERIENCE
ETL DeveloperConfidential, Plano, TX
Responsibilities:
- The project involves integration of different services provided by the client into one data store to perform cross-sell analysis.
- Involved in the creation of dimensional schema including dimension tables, fact tables, aggregate tables and target tables ( Confidential ).
- Involved in all phase of SDLC, Create details Analysis - design document with source to target mappings.
- Developed and maintained accurate project documentation and date model diagrams to provide management with proper understanding of organization needs.
- Prepared technical data flow proposals for enhancements and integration of existing third - party data. Communicated with business users and project management to get business requirement and translate to ELT / ELT specifications.
- Developed MapReduce programs to parse the raw data, populate staging tables and store the refined data in partition tables in the EDW.
- Involved in the development of jib sequencing using DataStage Sequencer.
- Managed Repository Metadata from DataStage Manager
- Created Hive queriesthat helped market analysis spot emerging trends by comparing fresh data with EDW reference tables and historical metrics
- Created ETL Confidential data stage parallel jobs to extract and reformat the source data so it can be loaded into new data warehouse schema.
- Developed Datastage jobs to do ETL transformations with requirement provided and load respective dimensions and Fact tables.
- Used SQL for data querying and database administration with Confidential database.
- Worked on migrating data stage jobs from infoSphere information server, Version 9.1 to infoSphere information server, Version 11.5.
- Conducted unit tests using various test cases and also conducted QC testing.
- Provide technical support to both business team and user departments for all projects.
- Production implementation and Post-Production Support
Environment: Confidential WebSphere DataStage 8.0.1, Confidential AIX 5.2, Confidential 10g,DMXpress tool, XML files, Autosys, MS SQL Server database, sequential flat files, TOAD, Confidential
ETL DeveloperConfidential, Addison, TX
Responsibilities:
- Responsible for Business Analysis and Requirements Collection.
- Worked on Informatica Power Center tools- Designer, Repository Manager, Workflow Manager, and Workflow Monitor.
- Understanding the business requirements and future roadmaps - multiple discussion with different business stakeholders( specially solution architects, business analysts/ domain SME’s)
- Assessed different source systems that will feed the Data Marts or centralized Enterprise Data Warehouse.
- Wrote 10+ BTEQ scripts to handle Change Data Capture logic and conducted data validation utilizing ETL and prepared test reports for loading data into the target tables ( Confidential tables).
- Wrote HQL scripts for loading data into the Hive tables and scheduling the jobs using AutoSys.
- Created (Extract, Transform and Load) ETL design mapping sheet, data reconciliation strategy - ETL framework, stored procedures and built SQL query objects to detect data loss.
- Maintained stored definitions, transformation rules and targets definitions using Informatica repository Manager.
- Used various transformations like Filter, Router, Expression, Lookup (connected and unconnected), Aggregator, Sequence Generator, Update Strategy, Joiner, Normalizer, Sorter and Union to develop robust mappings in the Informatica Designer.
- Designed and developed Informatica Mappings and Sessions based on business user requirements and business rules to load data from source flat files and Confidential tables to target tables.
- Review existing code, lead efforts to tweak and tune the performance of existing Informatica processes
- Loaded the flat files data using Informatica to the staging area.
- Created UNIX shell scripts for Informatica ETL tool to automate sessions.
- Maintained stored definitions, transformation rules and targets definitions using Informatica repository Manager.
- Used various transformations like Filter, Expression, Sequence Generator, Update Strategy, Joiner, Stored Procedure, and Union to develop robust mappings in the Informatica Designer.
Environment: Informatica Power Center 8.6.1, Workflow Manager, Workflow Monitor, Informatica Power Connect / Power Exchange, Data Analyzer 8.1, PL/SQL, Confidential 10g/9i, Autosys, SQL Server 2005, Sybase, UNIX AIX, Toad 9.0, Cogno, Confidential, HIve
Confidential
Application Production support
Responsibilities:
- Responsible for gathering troubleshooting issues and summarizing them into a weekly report for management.
- Designed and customized data models for Data warehouse supporting data from multiple sources on real time. Involved in building the ETL architecture and Source to Target mapping to load data into Data warehouse.
- Assisted in analyzing incoming equipment and developing the necessary control applications in Linux and UNIX.
- Responsible for all activities related to the development, implementation, administration and support of ETL processes for large scale data warehouses using Informatica Power Center.
- Wrote shell scripts for automating tasks such as for invoking applications (NDM, ftp) for file transmissions, monitoring system activity for resource contentions, system sanity checks, etc
- Provide production database support, enhance existing system and design and develop new systems.