Senior Datastage Consultant Resume
Southbury, CT
SUMMARY
- Over 10+ years of experience in Systems Analysis, Design, Development, Testing and implementation of Data Warehouse applications.
- Extensive ETL tool experience using IBM InfoSphere / WebSphere DataStage and Quality Stage
- Worked on Datastage client tools like Datastage Designer, Datastage Director and Datastage Administrator.
- Good Knowledge about Data Warehouse, Data marts, OLTP, OLAP, Dimensional Modeling, Fact tables, Dimension tables and Star/Snowflake schema modeling.
- Excellent in using highly scalable parallel processing infrastructure using parallel jobs with multi - node configuration files.
- Experienced in scheduling sequence, parallel and server jobs using Datastage Director, UNIX scripts and Autosys scheduling tools.
- Designed and developed parallel jobs, server and sequence jobs using Datastage Designer.
- Experience in using different types of stages like Transformer, Aggregator, Merge, Join, Lookup, Sort, Copy, Removeduplicate, Funnel, Filter, Pivot, Shared containers for developing jobs.
- Worked and extracted data from various data sources such as Oracle, DB2, Teradata, XML and Flat files.
- Extensive experience in Unit Testing, Functional Testing, System Testing, Integration Testing, Regression Testing, User Acceptance Testing and Performance Testing.
- Created local and shared containers to facilitate ease and reuse of jobs.
- Proven track record in addressing production issues like performance tuning, enhancement and memory issues.
- Imported the required Metadata from heterogeneous sources at the project level.
- Used the Data Stage Director and its run-time engine to schedule running the solution, testing and debugging its components, and monitoring the resulting executable versions (on an ad hoc or scheduled basis).
- Experience in Production Support, extensively worked on production support issues.
- Giving the value additions to the project by preparing the understanding documents and providing Knowledge transfer to the new resources in the project regarding the functional and technical aspects of the project flows
- Quick learner and adaptive to new and challenging technological environments.
- Project Management experience with excellent problem-solving, organization and leadership skills.
TECHNICAL SKILLS
ETL Tools: IBMInfoSphereDatastage 11.5/9.1/7.5.2/7.1.1 , Parallel Extender
Reporting Tools: Oracle Essbase, Business Objects
Databases: Teradata, SQL Server, Oracle, DB2 on Mainframe
Operating Systems: Windows, Confidential AIX, SUSE Linux
Data Modeling Method: Logical, Physical Data Modeling, Star-Schema Modeling
Languages: SQL, PL/SQL, C, C++,Korn/Bash Shell Scripting
Scheduler: Autosys, Cron, Control M.
PROFESSIONAL EXPERIENCE
Confidential, Southbury, CT
Senior DataStage Consultant
Responsibilities:
- Analyze and provide ETL architecture for all Invoices stream processing that runs periodically.
- Design and develop reusable components for data sourcing, cleansing, and integration related to all ETL processes.
- Define programming standards, procedures and best practices in support of ETL processes.
- Assist the business analyst in mapping the data from source to target and prepare prototype models to support the development efforts
- Extensive experience in analysis and design of database including ER Diagrams and Normalization techniques.
- Implemented data staging, cleansing and transformation mechanisms, normalized databases, Star/ Snowflake schemas, and Business Intelligence delivery mechanisms as required, including loading of data marts.
- Develop complex ETLs by usingDataStageparallel extender parallel processing capabilities.
- Help team members in tuning and improve performance for handling high-volume ETL jobs
- Extensive working experience in various industries like TAX Invoice Auditing.
- Experienced in scheduling sequence, parallel and server jobs usingDatasStage Director, UNIX scripts and scheduling tools.
- Developed parallel jobs using different processing stages like Transformer, Aggregator, Lookup, Join, Sort, Copy, Merge, Funnel, CDC, Change Apply and Filter.
- Experienced in creating Sequence files and XML Files as outputs for submissions.
- Experience in integration of various data sources like DB2, MS Excel and Flat files into the Staging Area.
- Imported the required Metadata from heterogeneous sources at the project level.
- Experienced in scheduling Jobs using AutoSys.
- Resolve technical and functional issues with the design through test phase of development
- Help team members in tuning and improve performance for handling high-volume ETL jobs.
- Work closely with project management, analysts, data modelers and the existing ETL Developers and Designers andto understand project requirements and contribute to the project solution
- Experience in resolving Data Transformation, Cleansing and Capturing rejects and Exception and Error reporting.
- Good experience in Transforming Business specific rules into functional Specs.
- Experience in Production Support, extensively worked on production support issues and resolved them using session logs, workflow logs, and used e-mail task for capturing issues via e-mail along with the session logs.
- Ability to work autonomously and as part of a team under tight deadlines to meet any project expectations.
- Co ordinating with Offshore teams for getting the Business requirements and Deliverables as well to be completed timely.
Environment: Confidential Infosphere Data Stage and Quality Stage11.5, DB2, Java, MS-Access, Unix Shell scripts, PUTTY, WinSCP, ERwin R8.1, HP Quality Center, AutoSys,QMF Work Station, Eclipse.
Confidential, Boston, MA
Senior DatastageConsultant
Responsibilities:
- Involved in complete Software Development Life Cycle (SDLC) of various projects, including requirements gathering, system designing, data modeling, ETL development, production enhancement, support and maintenance.
- Extensive experience in analysis and design of database including ER Diagrams and Normalization techniques.
- Experience with Star and Snowflake Schema, Data Modeling, Fact and Dimensional Tables and Slowly Changing Dimensions.
- Experience includes working in various industries like Financial, Supply chain management, Healthcare, Insurance and Retail.
- Experienced in scheduling sequence, parallel and server jobs usingDatastageDirector, UNIX scripts and scheduling tools.
- Extensively worked on performance tuning and backup on databases like Oracle 11g/10g/9i/8i.
- Developed parallel jobs using different processing stages like Transformer, Aggregator, Lookup, Join, Sort, Copy, Merge, Funnel, CDC, Change Apply and Filter.
- Experience in integration of various data sources like Teradata, Oracle, DB2, SQL Server, MS Access, Sybase, Informix, MS Excel and Flat files into the Staging Area. Extensively worked with materialized views and TOAD.
- Imported the required Metadata from heterogeneous sources at the project level.
- Good experience in scheduling Jobs using AutoSys, Tivoli, Zeke and Crontab.
- Knowledge in using PL/SQL to write stored procedures, functions, and triggers.
- Generated Surrogate IDs for the dimensions in the fact table for indexed and faster access of data in server jobs.
- Created local and shared containers to facilitate ease and reuse of jobs.
- Experience in resolving Data Transformation, Cleansing and Capturing rejects and Exception and Error reporting.
- Extensively worked on SAP MDM Pack to deliver data to online analytical processing (OLAP).
- Good knowledge on reporting tools like, Business Objects, and Oracle Oracle Essbase.
- Good experience in Transforming Business specific rules into functional Specs.
- Experience in Production Support, extensively worked on production support issues and resolved them using session logs, workflow logs, and used e-mail task for capturing issues via e-mail along with the session logs.
- Working experience in interacting with business analysts and developers to analyze the user requirements, functional specifications and system specifications.
- Ability to work autonomously and also as part of a team under tight deadlines so as to meet any project expectations.
Environment: Confidential Infosphere Data Stage11.5/9.1, Microsoft SQL 2012/2008, AIX6.0, Oracle 11g, Toad 9.5, Java, MS-Access, shell scripts, PUTTY, WinSCP, ERwin R8.1, HP Quality Center, AutoSys
Confidential, Miami, FL
Senior DatastageConsultant
Responsibilities:
- Involved in data profiling, modeling, planned and designed the batch/runs as per requirements.
- Designed and developed the ETL jobs using Parallel Edition, which distributed the incoming data concurrently across all the processors, to achieve the best performance.
- Preparation of technical specification for development of Extraction, Transformation and loading (ETL) jobs to load into various tables in DataMart.
- Designed parallel jobs using stages such as Join, Merge, Lookup, Remove Duplicates, Copy, Filter, Funnel, Dataset, Lookup, Pivot, and Sort, Surrogate key Generator, Change Data Capture (CDC), Modify, Row Generator and Aggregator.
- Involved in development of UNIX shell script for Batch jobs run.
- Responsible for generation of DDL statements, which are executed, for database creation.
- Designed the ETL jobs using Confidential InfoSphereDataStage8.5 to extract, Transform and load the data into staging, ODS and EDW.
- Responsible for preparing Physical/logical data models.
- Responsible for data analysis, requirements gathering, report analysis, source-to-target mapping, frequency analysis, process flow diagrams, and documentation.
- Handled Performance Tuning of Jobs to ensure faster Data Loads.
- Designed sequence jobs using the activities such as Job Activity, Nested Condition, Notification Activity, Sequencer Activity, Terminator Activity and Execute Command.
- Used Information Analyzer for Column Analysis, Primary Key Analysis and Foreign Key Analysis.
- Effectively worked on Quality Stage for standardization and cleansing the data.
- Performed the Integration and System testing on the ETL jobs.
- Responsible for preparing ad hoc jobs.
- Used Teradata PMON to monitor the performance by using explain plan by tuning many SQL queries.
- Designed and Created data cleansing, data conversion, validation and External loading scripts like MLOAD and FLOAD for Teradata warehouse using Datastage ETL tool.
- Migrated projects from development to QA to Production environments.
- ETL code peer review.
- Assisted operation support team for transactional data loads in developing SQL & UNIX scripts.
- Imported the required Metadata from heterogeneous sources at the process level.
- Created Job Parameters and Environment variables to run the same job for different sources and targets.
- Designed a job template that provides the Environmental parameters for the subsequent use in the projects.
- Created Shared Containers for Re-using the Business functionality.
- Collaborated with BO team to design Crystal Reporting and reports for enterprise reporting applications.
- Worked with Developers to troubleshoot and resolve issues in job logic as well as performance.
Environment: Confidential InfosphereDatastage9.1/8.5 (Administrator, Designer, Director), Microsoft SQL 2008, Oracle 10g, Teradata, MS-Access, shell scripts, Putty, WinSCP, Quality Center, Confidential Rational Rose, AutoSys
Confidential, Alpharetta, GA
Data Stage Developer
Responsibilities:
- Involved in complete Software Development Life Cycle (SDLC) of various projects, including requirements gathering, system designing, data modeling, ETL development, production enhancement, support and maintenance.
- Extensive experience in analysis and design of database including ER Diagrams and Normalization techniques.
- Designed parallel jobs using stages such as Join, Merge, Lookup, Remove Duplicates, Copy, Filter, Funnel, Dataset, Lookup, Pivot, and Sort, Surrogate key Generator, Change Data Capture (CDC), Modify, Row Generator and Aggregator.
- Responsible for generation of DDL statements, which are executed, for database creation.
- Collaborated in Extraction of OLAP data from SSAS using SSIS.
- Collaborated with BO team to design SSRS Reporting and reports for enterprise reporting applications.
- Extensively used SQL Server Integration and Reporting Services SSIS and SSRS.
- Developed Reports using SQL Server Reporting Services (SSRS) and SSIS packages and designing ETL processes.
- Worked with Developers to troubleshoot and resolve issues in job logic as well as performance.
- Handled Performance Tuning of Jobs to ensure faster Data Loads.
- Designed sequence jobs using the activities such as Job Activity, Nested Condition, Notification Activity, Sequencer Activity, Terminator Activity and Execute Command.
Environment: Confidential InfoSphereDatastage8.5 (Administrator, Designer, Director),Microsoft SQL 2005/2008, Oracle 10g, Java, MS-Access, shell scripts, PUTTY, WinSCP, Mercury Quality Center, PVCS, AutoSys
Confidential, Richmond, VA
Data Stage Developer
Responsibilities:
- Extensively used Datastagefor extracting, transforming and loading databases from sources including Oracle, DB2 and Flat files.
- Collaborated with EDW team in, High Level design documents for extract, transform, validate and load ETL process data dictionaries, Metadata descriptions, file layouts and flow diagrams.
- Collaborated with EDW team in, Low Level design document for mapping the files from source to target and implementing business logic.
- Generation of Surrogate Keys for the dimensions and fact tables for indexing and faster access of data in Data Warehouse.
- Tuned transformations and jobs for Performance Enhancement.
- Extracted data from flat files and then transformed according to the requirement and Loaded into target tables using various stages like sequential file, Look up, Aggregator, Transformer, Join, Remove Duplicates, Change data capture, Sort, Column generators, Funnel and Oracle Enterprise.
- Created Batches (DS job controls) and Sequences to control set of jobs.
- Extensively used DatastageChange Data Capture for DB2 and Oracle files and employed change capture stage in parallel jobs.
- Executed Pre and Post session commands on Source and Target database using Shell scripting.
- Collaborated in design testing using HP Quality Center.
- Extensively worked on Job Sequences to Control the Execution of the job flow using various Activities & Triggers (Conditional and Unconditional) like Job Activity, Wait for file, Email Notification, Sequencer, Exception handler activity and Execute Command.
- Collaborated in Extraction of OLAP data from SSAS using SSIS.
- Extensively used SAP R/3 and SAP BW packs
- Collaborated with BI and BO teams to find how reports are affected by a change to the corporate data model.
- Collaborated with BO teams in designing dashboards and scorecards for Analysis and Tracking of key business metrics and goals.
- Utilized Parallelism through different partition methods to optimize performance in a large database environment.
- Developed DS jobs to populate the data into staging and Data Mart.
- Executed jobs through sequencer for better performance and easy maintenance.
- Performed the Unit testing for jobs developed to ensure that it meets the requirements.
- Developed UNIX shell scripts to automate file manipulation and data loading procedures.
- Scheduled the jobs using AutoSys, Tivoli and Crontab.
- Collaborated in developing Java Custom Objects to derive the data using Java API.
- Responsible for daily verification that all scripts, downloads, and file copies were executed as planned, troubleshooting any steps that failed, and providing both immediate and long-term problem resolution.
- Provided technical assistance and support to IT analysts and business community.
Environment: Confidential WebSphereData stage8.0.1a (Administrator, Designer, Director), Microsoft SQL 2005/2008, Microsoft SQL 2008, Oracle 10g, MS Access, shell scripts, PUTTY, WinSCP, HP Quality Center, Confidential Rational Rose, AutoSys.
Confidential, Milwaukee, WI
Data StageDeveloper
Responsibilities:
- Involved in various roles of Administrator and Developer throughout the project.
- Conducted the training sessions for other ETL developers on the best practices as well as performance improvement techniques.
- Extensively usedDatastagefor extracting, transforming and loading databases from sources including Oracle, DB2 and Flat files.
- Managed analysis, Design, coding and testing of ETL jobs for 7 Source Systems.
- Involved in implementing the Best practices and design standards. The Best practices include Restart-ability, Recovery, Parameter standardization and Capacity planning, etc.
- Collaborated with EDW team in, Low Level design document for mapping the files from source to target and implementing business logic.
- Generation of Surrogate Keys for the dimensions and fact tables for indexing and faster access of data in Data Warehouse.
- Used Partition methods and collection methods for implementing parallel processing.
- Developed complexData stagejobs according to the business requirements / mapping documents.
- Created Batches (DS job controls) and Sequences to control set of jobs.
- Collaborated in design testing using HP Quality Center.
- Performed Unit Testing, System Integration Testing and User acceptance testing.
- Extensively Designed local containers and shared containers to simplify and modularize job design by replacing complex logics with single container stage and also to promote reusability of job designs.
- Involved in importing and exporting jobs category wise and maintaining the backup regularly.
- Collaborated with BO teams in designing dashboards and scorecards for Analysis and Tracking of key business metrics and goals.
- Used designer and director to schedules and monitor jobs and to collect the performance statistics.
- Worked within a team to populate Type I and Type II slowly changing dimension tables from several perational source files. Created some routines (Before-After, Transform function) used across the project.
- Involved in creating UNIX shell scripts for database connectivity and executing queries in parallel job execution.
- Responsible to tune ETL processes to optimize load and query performance.
- Created standards document, best practices guide and performance tuning techniques documents.
Environment: AscentialDatastage 7.5.1, Microsoft SQL 2005/2008, Microsoft SQL 2008, Oracle 11g, Toad 9.5, Java, MS Access, SAP BW, SAP MDM, AS/400, shell scripts, PUTTY, WinSCP, ERWIN 4.0, HP Quality Center, Tivoli, Confidential Rational Rose, AutoSys
Confidential, Pittsburgh, PA
Data Stage Developer
Responsibilities:
- Part of design team and production Support team for the conversion project.
- Part of a design team for design of STAR schema for data warehouse project.
- Interacted with the End users / Customers for Creating Mapping documents.
- Created Mapping documents for Migration project from MVS to Oracle databases.
- Involved in implementing the Best practices and design standards. The Best practices include Restart-ability, Recovery, Parameter standardization and Capacity planning, etc.
- Done extensive business analysis to analyze the source system and talking to the business groups to understand the reporting requirements.
- Analyzed the source system, which is on MVS, COBOL, and DB2 to form the business rules.
- Designed the mapping documents between source databases and target databases.
- Designed and developed Customer mart and Sales mart using the data from the centralized data warehouse using top-down approach.
- Worked on critical Occurs and Redefines in the complex flat file structures from COBOL.
- Done data analysis, quality analysis, and data loading.
- Worked within a team to populate Type I and Type II slowly changing dimension tables from several operational source files. Created some routines (Before-After, Transform function) used across the project.
- Developing processes for extracting, cleansing, transforming, integrating and loading data into databases.
- Created extract processes, analyzing the MVS-Cobol, DB2 code to pull the required data.
- Developed manyData stageserver jobs for data processing and loading of data.
- Developed Load jobs Oracle and DB2databases.
- Used TOAD tool for the analysis part.
- Used Autosys for Scheduling of the Jobs.
Environment: Data stageEnterprise Edition 7.1 Parallel Extender, ProfileStage, MetaStage, Oracle 9i, Sybase, AIX, PL/SQL, Business Objects 6.5, OLAP, ERwin5.5, SQL plus, SQL*Loader, Sun Solaris 8.0.