Datastage Developer Resume
Chicago, IL
SUMMARY
- Over 6+ years of experience in data modeling, data warehouse, development and testing ETL using IBM WebSphere/InfoSphere DataStage 8.5/8.1/8.0.1/7.5
- Experience in extracting and integrating data from various database sources like Oracle 11g/10g/9i, flat file, and SQL server
- Proficient in data analysis, database design and data migration
- Program in SQL to perform the data query, extract, transform and load functions
- Experience in DataStge client components (Designer, Manger, Administrator, and Director)
- Experience in mapping parallel jobs in DataStage to populate tables in data warehouse and data marts.
- Proven track record in addressing production issues like performance tuning and enhancement.
- Excellent knowledge in creating and managing Conceptual, Logical and Physical Data Models.
- Knowledge in development, debugging, troubleshooting, monitoring and tuning using DataStage tools.
- Worked with different file stages of DataStage like Sequential, Dataset, File set, and Lookup File set
- Develop parallel jobs with different processing stages of DataStage like Filter, Sort, Aggregator, Copy, Transformer, Funnel, Join, Look Up, Merge, Pivot, Modify, Switch, Change Capture, Surrogate Key Generator, and Remove Duplicates
- Worked extensively with different development/debug stages of DataStage like Head, Tail, Row Generator, and Column Generator, Peek
- Developed efficient mappings for data extraction/transformation/loading (ETL) from different sources to a target data warehouse
- Dynamic, alert, and capable of producing high quality work under time constraints.
- Excellent in using highly scalable parallel processing Infrastructure using DataStage Parallel Extender
- Strong in Data Warehousing concepts, dimensional Star Schema and Snowflakes Schema methodologies
- Extensively Worked on Job Sequences to control the flow of job execution using various activities like Job Activity, Email Notification, Sequencer, Routine activity and Exec Command
- Excellent experience in Relational database (RDBMS), Oracle 11g/10g/9i, Microsoft SQL Server, Teradata Load, SQL Loader.
- Expert in working in various operating systems like UNIX and Windows
- Detail oriented with good problem solving, organizational, analysis, highly motivated and adaptive with the ability to grasp things quickly.
- Hardworking, ability to work effectively and efficiently in a team and individually with excellent interpersonal, technical and communication skills.
TECHNICAL SKILLS
ETL tools: DataStage 8.5/8.1/7.5 (Administrator, Manager, Designer, Director, Parallel Extender/Orchestrate and Server Jobs, Quality Stage/Integrity, Metastage), Data Warehousing Designer, SQL* Plus, SQL*Loader
Data Modeling: Dimensional Data Modeling, Data Modeling, Star Join Schema Modeling, Snow - Flake Modeling, FACT and Dimensions Tables, Physical and Logical Data Modeling, Erwin, Oracle Designer.
Databases: Oracle 11g/10g/9i, MS SQL Server, DB2, MS Access
Programming: Unix Shell Scripting, SQL, PL/SQL
Other Tools: Autosys, Microsoft Visio and Microsoft Project, TOAD
Operating Systems: UNIX, LINUX, WINDOWS XP/7/8
PROFESSIONAL EXPERIENCE
Confidential - Chicago, IL
DataStage Developer
Responsibilities:
- Involved in development phase meetings for Business Analysis and Requirements Gathering.
- Developed DataStage ETL jobs for extracting data from multiple sources.
- Analyzed the given source Dimensions and target Fact table structures to develop Surrogate Key tables referencing required dimensions.
- Developed various jobs using Hashed file, Aggregator, Sequential file, Transformer stages, Change Capture
- Developed Parallel jobs using Stages, which includes Join, Transformer, Sort, Merge, Filter and Lookup.
- Used Shared Containers and created reusable components for local and shared use in the ETL process.
- Imported and exported Repositories across DataStage projects.
- Analyzed data with discrepancies through Error files and Log files further data processing and cleansing.
- Used DataStage Director Runtime engine to schedule and execute developed jobs and job sequences, and use Log events to monitor job progress and performance.
- Participated in weekly status meetings, and conducting internal and external reviews as well as formal walkthroughs among various teams, and documenting the proceedings.
Environment: IBM Infosphere DataStage 8.5 (Manager, Director), Oracle 10g, MYSQL Server, Windows, DB2, Unix Shell Scripts, Autosys 4.5
Confidential - Detroit, MI
DataStage Developer
Responsibilities:
- Designed the ETL jobs using IBM Infosphere DataStage 8.0.1 to Extract, Transform and load the data into staging, ODS and EDW.
- Designed and developed the ETL jobs using Parallel Edition, which distributed the incoming data concurrently across all the processors, to achieve the best performance.
- Designed parallel jobs using stages such as Join, Merge, Lookup, Remove Duplicates, Copy, Filter, Funnel, Dataset, Lookup, Pivot, and Sort, Surrogate key Generator, Change Data Capture (CDC), Modify, Row Generator and Aggregator.
- Responsible for preparing Physical/logical data models.
- Responsible for data analysis, requirements gathering, report analysis, source-to-target mapping, frequency analysis, process flow diagrams, and documentation.
- Handled Performance Tuning of Jobs to ensure faster Data Loads.
- Designed sequence jobs using the activities such as Job Activity, Nested Condition, Notification Activity, Sequencer Activity, Terminator Activity and Execute Command.
- Performed the Integration and System testing on the ETL jobs.
- Translated business rules and functionality requirements into ETL derivations.
- Scheduled jobs using Autosys scheduler utility based on the requirements and monitored the production processes closely for any possible errors.
- Migrated projects from development to QA to Production environments.
- Assisted operation support team for transactional data loads in developing SQL & UNIX scripts.
- Imported the required Metadata from heterogeneous sources at the process level.
- Created Job Parameters and Environment variables to run the same job for different sources and targets.
- Used Multi-job-compiler during deployment of projects.
- Created Batches (DataStage job controls) and Sequences to control set of jobs.
- Used the Director to schedule running the job, testing and debugging its components, and monitoring.
- Created Shared Containers for Re-using the Business functionality.
- Collaborated with BO team to design Crystal reporting and reports for enterprise reporting applications.
- Worked with Developers to troubleshoot and resolve issues in job logic as well as performance.
Environment: IBM Infosphere DataStage 8.0.1, Erwin 4.0, Oracle 10i, SQL Server 2000, PL/SQL, SQL*Loader, SQL*plus, Business Objects 5.X, Windows NT.
Confidential - Southfield, MI
DataStage Developer
Responsibilities:
- Involved in meetings with the users to gather and analyze the requirements.
- Worked closely with the Data analyst and business analyst during the design and development of ETL technical specification document.
- Used DataStage as an ETL to extract data from sources like flat files and DB2 and loaded to target DB2 UDB.
- Extensively used parallel stages like Aggregator, Sort, Column Generator, Join, Merge, Head and peek for development and de-bugging purposes.
- Developed jobs to standardize and improving error handling.
- Fixing various bugs in the applications by changing the design after analysis.
- Understanding the business rules and changing the application for performance and stability and adhering to business rules.
- Gathering the metadata definition of the source systems, preparing Transformation rules according to the business requirements for new enhancements.
- Good experience in working as coordinator to offshore team from Onshore under ETL, DataStage, environment.
- Created reusable components like parameter sets.
- Involved in the performance tuning during historical and Daily loads and reduced the batch window and also in making the application Robust.
- Experience in deploying new applications into the production environment.
- Good experience in creating various reusable jobs as part of enhancements, data requests and production fixes.
- Experience in handling Transition activities of the new applications from development team to support team.
- Involved in importing and exporting jobs category wise and maintaining the backup regularly.
Environment: IBM InfoSphere DataStage8.0, Oracle 10g, MS Visio, MS Project, MS Office Suite, Teradata, Windows XP, Aqua Data Studio, Flat Files, SQL Developer, Unix Shell Scripting, Linux (Red Hat).
Confidential -Olympia, WA
DataStage Developer
Responsibilities:
- Involved in development phase meetings for Business Analysis and requirement gathering.
- Developed parallel jobs to load the data from flat files, text files, and MS SQL.
- Suggested various changes in the physical model to support the business requirements.
- Set various parameters in DataStage Administrator for performance tuning.
- Used DataStage Designer to develop various jobs.
- Utilized shared containers for code reusability and for implementing the predefined business logic.
- Created and scheduled the job sequences by checking job dependencies.
- Wrote complex SQL queries using joins, sub queries and correlated sub queries.
- Performed Unit Testing to check the validity of the data at each stage.
- Used DataStage Director to debug the jobs and to view the error log to check for errors.
- Implemented best practices in the development environment (code standards, code migration).
- Wrote PL/SQL stored procedures, functions and packages and triggers to implement business rules into the application.
Environment: Datastage 8.0.1, Windows 7, Linux, DB2, UNIX, IBM Mainframe, Teradata, Data Set, Flat Files, Sequential files, SQL Loader, Toad, Autosys, SQL, PL/SQL, Oracle SQL Plus, UNIX Shell Scripting, Erwin
Confidential - Canton, MI
DataStage Developer
Responsibilities:
- Used various sourced and designed and developed jobs in order to facilitate the integration process of the expenditures.
- Analyzed reports made by business user to extract data to effectively integrate expenditures from various sources.
- Developed number of staging jobs to load the data into target table, created number of complex jobs in the life cycle and also was involved in migration, error management, reporting, and data quality issues.
- Worked with widely used stages like Lookup, Join, Pivot, Transformer, Sort, Aggregator, Merge, Row Generator, and Column Generator and also troubleshoot the designed jobs and tested the jobs for all logical errors.
- Evaluated the Data extraction from source to confirm for data irregularities and classify corrupt data to apply suitable transformation in the mappings in the jobs.
- Used diverse partitioning methods and were also involved in preparation of test cases concerned with Unit testing, System testing to check data reliability.
- Used DataStage to load data from Oracle, DB2 and Flat Files, also developed SQL scripts to supplement the ETL process and to verify on final target data and developed PL/SQL programs and Unix Scripts.
- Involved in Data Extraction, Transformation and Loading (ETL process) from Source to target systems using DataStage.
- Broadly involved in data modeling support for many strategic application development projects and planned and developed Business application.
- Worked extensively with sequencer jobs stages such as job activity stage and terminator activity stage to run various parallel jobs.
Environment: IBM Websphere Datastage 7.5 (Designer, Director, Administrator), Oracle 10g/9i, UNIX, Shell Scripting, Metastage, PL/SQL, MS Access.
Confidential - VA
DataStage Developer
Responsibilities:
- Design Datastage sequences to specify Job execution order.
- Redesigned few jobs in Datastage Designer to meet the changes in new incoming feeds.
- Coordinated with client managers, business architects and data architects for various sign offs on data models, ETL design docs, testing docs, migrations and end user review specs.
- Design of Logical and Physical Data Models and developed the STAR Schema models
- Multi source Extraction Transformation and Loading into the Repository using Data Stage.
- Developed various Data extractions, Transformations rules based on Business requirements.
- Designed ETL jobs to identify and remove duplicate rows using remove duplicate stage.
- Code review of several mappings which could have a performance impact.
- Created and modified all the jobs and shell scripts with new email Id’s and groups for abort and failure notifications.
- Involved in unit, system and end-to-end testing of the design.
- Actively involved in everyday job monitoring and resolved all the issues which are related to job aborts and job failures.
- Worked on call for production support.
- Assist in planning and installing Datastage upgrades and project implementation.
- Ensure that the system continues to perform to maximum efficiency after the go-live with minimum disruption.
Environment: IBM Websphere Datastage 7.5, Oracle 9i, Windows XP, MS Office, UNIX