Sr.data Stage Developer Resume
MI
SUMMARY
- Over 7 years of experience in Data modeling, Datawarehouse Design, Development and Testing using ETL and Data Migration life cycle using IBM WebSphere DataStage 8.x/7.x
- Expertise in building Operational Data Store (ODS), Data Marts, and Decision Support Systems (DSS) using Multidimensional Model(Kimball and Inmon),Star and Snowflake schema design.
- Experience in analyzing the data generated by the business process, defining the granularity, source to target mapping of the data elements, creating Indexes and Aggregate tables for the data warehouse design and development.
- Data Processing experience in designing and implementing Data Mart applications, mainly transformation processes using ETL tool DataStage (Ver8.0/7), designing and developing jobs using DataStage Designer, Data Stage Manager, DataStage Director and DataStage Debugger.
- Efficient in all phases of the development lifecycle, coherent with Data Cleansing, Data Conversion, Performance Tuning and System Testing.
- Excellent in using highly scalable parallel processing Infrastructure using DataStage Parallel Extender.
- Efficient in in corporation of various data sources such as Oracle, MS SQL Server, and DB2, Sybase, XML and Flat files into the staging area.
- Experience in Mapping Server/parallel Jobs in DataStage to populate tables in Data warehouse and Data marts.
- Proven track record in addressing production issues like performance tuning and enhancement.
- Excellent knowledge in creating and managing Conceptual, Logical and Physical Data Models.
- Experience in dimensional and relational database design.
- Strong in Data warehousing concepts, dimensional Star Schema and Snowflakes Schema methodologies.
- Expert in unit testing, system integration testing, implementation, maintenance and performance tuning.
- Experience in different Scheduling tools like AutoSys for automating and scheduling jobs run.
- Excellent with PL/SQL, T - SQL, Stored Procedures, Database Triggers and SQL * Loader.
- Experience in UNIX Shell Scripting.
- Excellent knowledge of operating systems Windows, UNIX, Macintosh, and databases including Oracle, SQL Server,and DB2.
- Experience in implementing Quality Processes like ISO 9001:2000/Audits.
- Detail oriented with good problem solving, organizational, analysis, highly motivated and adaptive with the ability to grasp things quickly.
- Ability to work effectively and efficiently in a team and individually with excellent interpersonal, technical and communication skills.
TECHNICAL SKILLS
ETL Tools: IBM Infosphere Datastage Enterprise Edition 8.5/ 8.1/7.5/7.1 (Manager, Designer, Director, Version Control, Administrator), IBM Information Server 8.1, IBM Information Analyzer 8.1
Database: Oracle 11i/9i/8i/8.0/7.0, SQL Server 2000, DB2 UDB 8.0, MS Access 2003
Reporting Tool: Business Objects 5.0
Languages: PL/SQL, Shell Script, C/C++, XML
Data Modeling Tools: Erwin 4.0/7.0
Operating Systems: IBM UNIX AIX 4.2, Solaris SunOS5.9, Linux, Windows 2000/NT/XP/Vista
Tools: TOAD 8.0, Win SQL, MS Office 2003, Command Center/Control Center
PROFESSIONAL EXPERIENCE
Confidential, MI
Sr.Data Stage Developer
Responsibilities:
- Used various sourced and designed and developed jobs in order to facilitate the integration process of the expenditures.
- Analyzed reports made by business user to extract data to effectively integrate expenditures from various sources.
- Developed number of staging jobs to load the data into target table, created number of complex jobs in the life cycle and also was involved in migration, error management, reporting, and data quality issues.
- Worked with widely used stages like Flat File, Lookup, Join, Pivot, Transformer, Sort, Aggregator, Merge, Row Generator, and Column Generator and also troubleshooted the designed jobs and tested the jobs for all logical errors.
- Evaluated the Data extraction from source to confirm for data irregularities and classify corrupt data to apply suitable transformation in the mappings in the jobs.
- Used diverse partitioning methods like Auto, Hash, Same, Entire etc. and were also involved in preparation of Test Cases concerned with Unit testing, System testing to check data reliability.
- Used Datastage to load data from Oracle, DB2 and Flat Files, also developed SQL scripts to supplement the ETL process and to verify on final target data and developed PL/SQL programs and Unix Scripts.
- Involved in Data Extraction, Transformation and Loading (ETL process) from Source to target systems using Data stage.
- Broadly involved in data modeling support for many strategic application development projects and planned and developed Business application.
- Worked extensively with sequencer jobs stages such as job activity stage and terminator activity stage to run various parallel jobs.
Environment: IBM InfoSphere DataStage 8.1 (Designer, Director, Administrator), Oracle 10g/9i, UNIX, Shell Scripting, Erwin 7.0, Qualitystage, Metastage, Info Analyzer, TOAD, PL/SQL, Autosys, MS Access.
Confidential, Harrisburg, PA
DataStage Developer
Responsibilities:
- Designed DataStage jobs for the extraction, transformation and loading of data into the IDS and HDS Data warehouses.
- Identified source systems, their connectivity, the related tables and fields and ensured that data was appropriate for mapping.
- Generated Surrogate ID’s for the dimensions in the fact table for indexing and faster access of data.
- Used Stage Variables, Transforms, Subroutines of DataStage Jobs.
- Worked with parallel job message handler for demoting or removing warning messages from the job log.
- Imported and exported repositories across different servers using DataStage Manager.
- Participated in Design and implementation of Star and Snowflake schemas.
- Worked with offshore teams dealing with production requirements.
- Improved job performance by gathering the performance statistics of the jobs in the production server using DataStage Director.
- Extensively used the Sequential File stage, Complex Flat File Stage, Hashed File Stage, Modify, DataSet, Filter, Funnel, Join, Lookup, Merge, Copy, Aggregator, and Change Capture during ETL development.
- Simplified the existing transformer logic in the jobs to make them perform better.
- Worked on different partitioning methods (Round Robin, Entire, Hash by Field, Modulus and Range) and collection methods (Round Robin, Order and Sort Merge) in processing sequential data.
- Participated in the review of Technical, Business Transformation Requirements Documentation.
- Trouble-shoot the designed jobs using the DataStage Debugger.
- Created job sequences and job schedules to automate the ETL process using Autosys.
- Worked with DataStages Teradata connector, Teradata Enterprise and Teradata Multiload.
- Documented the changes made to the jobs, the comparative performances of the old and new jobs and the test plan that was used to test the jobs.
- Monitored and resolved the ticketing issues that arouse during the implementation of the jobs.
- Used UNIX shell scripts and commands to run the jobs.
Environment: IBM InfoSphere DataStage 8.1, Designer, Director, MetaStage, QualityStage, Flat File, Administrator, Information Analyzer, Windows XP, UNIX, Oracle 10g, SQL *Plus, Teradata, Toad, ODBC, DB2, Autosys, Erwin 4.5.2.
Confidential, Chicago, IL
DataStage ETL Analyst
Responsibilities:
- Designed/wrote the tech specs (Source-Target mappings) for the ETL mappings along with the Unit Test scripts.
- Involved in migration of DataStage projects and jobs from earlier versions to IBM InfoSphere 8.1 version.
- Used AutoSys to schedule jobs and e-mailed the status of ETL jobs to operations team daily.
- Used IBM Infosphere Federation Server to in corporate data from multiple data sources into reports and analytics with a single query.
- Used Director Client to validate, run, schedule and monitor the jobs that are run by IBM Infosphere DataStage server.
- Used DataStage Designer to develop parallel jobs to extract, cleanse, transform, integrate and load data into Data Warehouse.
- Used DataStage Director to schedule, monitor and analyze DataStage jobs.
- Developed jobs in Ascential Parallel Extender PX using different stages like Transformer, Aggregator, Lookup, Join, Merge, Modify, Remove Duplicate, Oracle Stage, Sort, Peek, Row Generator, Column Generator, Sequential File and Data Set.
- Designed DataStage sequences to specify Job execution order.
- Loaded data into staging area and then into Data Marts.
- Worked as a Datastage administrator to perform routine administrative roles.
- Imported and exported Repositories across DataStage projects using DataStage Manager
- Unit tested DataStage Jobs in development including creating the appropriate test data.
Environment: Ascential DataStage 7.5, ProfileStage, QualityStage, MetaStage, Erwin, Oracle11g, DB2, PL/SQL, Toad, Solaris and Windows XP, AutoSys
Confidential, New York, NY
Data Stage Developer
Responsibilities:
- Involved in all phases of project which includes analysis, design, development, coding, testing, implementation, support and documentation.
- Interacted with end users in finalizing the requirements and documented the Program Specifications for the ETL jobs.
- Designed Data Stage Jobs to extract data from various source systems, Transformation and conversion of data, load data to data warehouse.
- Designed Jobs using stages such as complex flat file, shared containers, surrogate key generators and also used DataStage Director to schedule Jobs.
- Developed Sequence jobs using DataStage Sequencer and dependency analysis (which indicates weather job ran sequentially or in Parallel).
- Used several data stage Sequence stages like Job Activity, Execute command, Sequencer, Exception Handler, Wait for a file activity, Terminator activity and Notification activity.
- Involved in Quality stage for cleansing data, worked with address verification stage for cleansing address data.
- Experience developing complex transformations, surrogate keys, routines, dimension tables and fact tables and using parameterized metadata for sequential file (schema file), loaded into multiple tables using a single DataStage job.
- Used Stage Variables and Routines for developing Parameter Driven Jobs and debugging them.
- Closely associated with teammates, source side developers, managers, client side Data Analysts, and end users for the best implementation of the jobs.
Environment: Data Stage 7.0(Designer, Director, Administrator), Erwin, Oracle 9i, Teradata, UNIX, MS Access.