Etl Consultant Resume
Houston, TX
SUMMARY
- IBM Certified Solution Developer in InfoSphere DataStage 8.5 with 8years of experience in Client/Server business systems design/analysis/testing, Data Warehousing, Data Marts and Business Intelligence applications.
- 8 years of experience in working with ODS, Data Warehouse, Data Marts with Extraction Transformation and Loading process using IBM Information Server DataStage 11.3/9.1/8.5/7.5
- Designed Server Jobs, Job Sequencers, Batch Jobs and Parallel jobs.
- Over 2 years of experience in Installation, Up - gradation, Configuration and Administration of IBM InfoSphere Information Server IIS 11.3/9.1/8.1 Suite.
- 2 years of experience in Migration of DataStage and QualityStage Projects and Jobs from previous versions to 8.x Version of IIS InfoSphere Information Server suite.
- Worked with extraction and loading data from various database sources IBM DB2, Netezza,Oracle, SQL Server, Teradata, (Mainframe, Flat and XML) Files.
- Experience in developing and monitoring batch Jobs using UNIX and Shell scripts.
- Proven track record in troubleshooting DataStage jobs and addressing production issues like performance tuning and enhancement, testing and debugging.
- Involved in E-R modeling and Dimensional data modeling and analysis of Star schema and Snow flake schema.
- Extensive experience in Star schema, Snowflake Schema and 3NFs.
- Strong understanding of the principles of Data Warehouse using Fact Tables, Dimension Tables, Star Schema modeling, Ralph-Kimball and Bill-Inmon approaches.
- Proficiency in Data Warehousing techniques for Data Cleansing, Slowly Changing Dimension phenomenon, and Surrogate key assignment.
- Expertise in writing scripts using Teradata Extract and Load Utilities like: BTEQ, Teradata SQL Assistant, Tpump, MLoad, FastExport, and TPT.
- Experience in database and SQL programming for faster transformations and automation.
- Hands on experience in initial Production support and maintenance.
- Experience in performance tuning of ETL processes and DataStage jobs in SMP and MPP environments from both system as well as job design perspectives.
- Experience in scheduling jobs using various third party tools like CA Workstation,Control - M and ESP Scheduler.
- Created interface design document for jobs developed (detailed code description).
- Excellent analytical and functional skills with strong communication and interpersonal skills.
- Willing to learn and adapt to new challenges and technologies.
TECHNICAL SKILLS
ETL tools: IBM Information Server (DataStage & QualityStage) 11.3/9.1/8.5/ 7.5
Data Modeling: E-R Modeling, Star and Snowflake Schema Modeling
Databases: IBM DB2, Oracle 11g/10g/9i/8i, Netezza, SQL Server 2008/2005, Teradata V2R5.0
Languages: SQL, PL/SQL and Shell Scripting
Data Modeling Tools: ERwin, MS Visio
Operating System: AIX UNIX, Sun Solaris and Windows
Scheduling Tools: CA Workstation, Control - M, ESP Scheduler experience
PROFESSIONAL EXPERIENCE
Confidential, Houston, TX
Etl Consultant
Responsibilities:
- Worked closely with the project team in understanding end user requirements and provided technical feasibility.
- Analyzed Data Models with Business and System Analysts and participated in the preparations of the ETL Technical Design Document.
- Designed and developed ETL jobs using IBM DataStage 11.3 where Data Migration involved in input source DB2 database as well target systemOracleare transformed and enhanced.
- Used DataStage stages namely Sequential File, Datasets, Sort, Remove Duplicate Stage, Join, Lookup, Funnel, Transformer, Aggregator, Slowly Changing Dimensions, Peek and Row Generator stages in accomplishing DataStage coding.
- Developed Multi-Instance reusable DataStage jobs.
- Effectively implemented Partitioning and Parallelism techniques to fully utilize the resources and enhance job performance.
- Performed Peer review for DataStage code.
- Effectively used QualityStage for address standardization and matching process.
- Wrote SQL scripts to extract and load data from source and target databases.
- Extensively used DataStage Director for monitoring job logs in resolving issues.
- Worked on Job Control System in file transferring and in its maintenance.
- Documented Test Plan, Test Cases, Test Scripts and validations based on design specifications for Unit Testing, prepared test data for testing, error handling and analysis.
- Used CA Workstation for automation of DataStage processes in Dev, QA, INT, PVS and Rel and PROD environments.
- Involved in Production document design and supported Production for initial period.
Environment: IBM Information Server (DataStage, QualityStage)11.3, IBM DB2, Oracle,ERwin, Job Control System, CA WorkStation,Cognos
Confidential, Providence, RI
DataStage Consultant
Responsibilities:
- Participated in client meetings to review integrated data approaches and to interpret results.
- Extensively worked with Facts and Dimension tables to produce source - target mappings based upon business specifications.
- Participated in the development of high level and low level design documents.
- Worked on up gradation of DataStage from 7.5 to 8.5.
- Extensively worked with DataStage Designer to build jobs. The strategy in designing the jobs was to extract all the data, using transform rules, applying primary key strategy, foreign key strategy and finally loading into database.
- Used Environment Variables, Parameter Sets and Job Parameters for developing Parameter Driven Jobs and debugging them.
- Designed Job Sequences using Control jobs to streamline process flow implementing Date Adjustment/Restart Logic.
- Involved in performance tuning Parallel Jobs using Performance Statistics and Dump Score.
- Involved in Unit, System and Integration testing.
- Participated in meetings to know the updates of the business rules with client and with the team, and discussed to solve the data issues.
- Used Before and After Stage subroutines to further enhance performance of the system.
Environment: IBM WebSphere Information Server 8.5 (DataStage, Quality Stage)/IBM WebSphere DataStage 7.5.2, Oracle, Teradata, Control-M, MS Visio
Confidential, Bloomfield, CT
Etl Consultant
Responsibilities:
- Worked in Agile projects with Scrum teams and Product owners, and analyzed Business and Functional Requirements.
- Handled complex projects where input sources of Oracle and DB2 databases as well as flat files get transformed and enhanced using various ETL DataStage procedures.
- Staged in a DB2 database for delivery to downstream systems on the mainframe for further processing.
- Performed balancing and reporting using complex pivot stages and advanced transformer functions.
- Designed jobs using different parallel job stages such as Join, Merge, Lookup, Remove Duplicates, Copy, Filter, Funnel, Dataset, Change Captured, Modify, and Aggregator.
- Used Debugging stages like Row generator, Column Generator and Peek.
- Demonstrated proficiency in working with the XML and MQ connectors to extract data and created XML files.
- Implemented slowly change dimensions (SCD) type 1, type 2, type 3.
- Implemented incremental extraction and incremental load.
- Designed and developed various jobs for scheduling and running jobs under job sequencer and DataStage Director.
- Performed Administrator functions such as creating projects, setting tunables, protecting project, releasing the jobs, and setting environment variables.
- Extensively implemented import/export utilities for migrating code.
- Documenting and deploying IBM WebSphere DataStage ETL solution and QualityStage Patterns for ensuring Data Quality.
- Worked with configuration file in parallel extender to take maximum benefit from parallel environment.
Environment: Agile, IBM Information Server (DataStage & QualityStage) 9.1, MS Visio,Tectia SSH, Mainframe, IBM DB2,Oracle, ESP Scheduler, Serena Dimensions (Version Control), Hyperion reports
Confidential, Bentonville, AR
DataStage Consultant
Responsibilities:
- Designed and developed Extract, Transform, and Load (ETL) processes for extracting data from various legacy systems and loading into target tables using SQL and DataStage Enterprise Edition.
- Used different Partitioning methods in parallel jobs (Auto, Round-Robin, Hash, and Entire) to improve the performance and to get accurate results according to the business requirements.
- Extensively used DataStage Designer components to design various Parallel jobs in accordance with business specifications.
- Developed jobs in Parallel Extender using different stages like Join, Lookup stage, Netezza Enterprise, DB2 Connector and Enterprise, FTP stage, CFF stage, Copy stage, Filter, Aggregator, Pivot stage and Funnel stages.
- Used DataStage Director to verify logs and monitoring jobs during run session.
- Participated in the daily standup calls/status meetings to update the status about design and development and to discuss about the road blockers.
- Imported Metadata from Oracle database. Imported Metadata definitions into the repository.
- Involved in the preparation of ETL documentation by following the business rules, procedures and naming conventions.
- Performed troubleshooting and tuning of DataStage Jobs for better query performance.
Environment: DataStage Information Server 8.5, IBM Web Console, IBM DB2, Netezza, ERwin, Sun Solaris, SQL, PL/SQL
Confidential, Hartford, CT
ETL Consultant
Responsibilities:
- Analyzed Business Requirements by working closely with Business Analysts.
- Involved in writing transforms, routines for jobs and code for batch jobs.
- Used Parallel Extender to run the jobs in parallel which improved performance for straight bulk loads.
- Solved production and critical problems in ETL applications.
- Developed rerun procedures to overcome fail-over conditions for ETL batch jobs.
- Developed plans for migration tasks.
- Extensively designed and developed DataStage Server/Parallel jobs for data transformation functions.
- Provided on call support for the Production Application System. Created UNIX Shell scripts to automate the process.
- Developed Job sequences for executing DataStage jobs.
- Involved in discussions for the project deliverables including project charter, planning document, requirement specification document, and technical design document preparing the projection implementation plan.
- Designed Error handling process and generated Error reports to be sent as email whenever a job was aborted.
- Developed some Shell, Awk and Sed scripts to perform data audits, to manipulate the data in flat file and to schedule batch jobs.
- Prepared documentation for addressing the referential integrity relations in between the tables at ETL level.
Environment: IBM DataStage 7.5 (Enterprise Edition), ERwin, Oracle, SQL Server, T-SQL, SQL Server Management Studio, AIX UNIX
Confidential, Cranston, RI
ETL Consultant
Responsibilities:
- Developed / designed various new processes and fixed the existing process with new business requirements, various meetings with users’input.
- Designed and developed jobs for extracting, transforming, integrating, and loading data into data mart using DataStage Designer.
- Developed, executed, monitored and validated the ETL DataStage jobs in the DataStage Designer and Director Components.
- Worked with DataStage Director to schedule, monitor, analyze performance of individual stages and run DataStage jobs.
- Extensively used Change Capture, Transformer, Modify, Copy, Join,Funnel, Aggregator,Lookup and Development stages to develop the parallel jobs.
- Generated Surrogate Keys for composite attributes while loading the data into Data Warehouse using Key Management functions.
- Developed user defined Routines and Transformations for implementing complex business logic.
- Developed Job Sequencer and batches and have edited the job control to have jobs run in sequence.
- Imported Metadata from Oracle database. Imported Metadata definitions into the repository. Exported and imported DataStage components using DataStage Manager.
- Involved in the preparation of ETL documentation by following the business rules, procedures and naming conventions.
- Performed troubleshooting and tuning of DataStage jobs for better query performance.
- Reviewed the developed jobs based on the build review checklists.
- Responsible for Unit, System and Integration testing. Developed Test Scripts, Test Plan and Test Data.
Environment: Ascential DataStage 7.5, PVCS Version Controller, Oracle, UNIX
Confidential
DataStage Developer
Responsibilities:
- Involved in creating and maintaining Sequencer and Batch jobs.
- Used DataStage EE 7.5.2 to load data into the Oracle warehouse.
- Involved in development of Job Sequencing using the DataStage Sequencer.
- Used DataStage Designer and Director to schedule and monitor resulting executable versions.
- Created local and shared containers to facilitate ease and reuse of jobs.
- Managed Repository Metadata from DataStage Manager.
- Performing Unit Testing and integration.
Environment: Ascential DataStage, Oracle Citrix, Mercury Test Director