Etl/ Datastage Developer Resume
Medina, OH
SUMMARY
- Over 7+ years of experience in IT Industry especially in Client/Server business systems, Data Warehouse and Decision Support Systems (DSS) Analysis, Design, Development, Testing and Implementation and Production Support of Data Warehousing applications.
- Have a good experience with Extraction Transformation and Loading tool using DataStage 11.5,8.7, 8.5, 8.0.1, 7.5.2, 7.1 (Server and Parallel Editions).
- Involved in Design, Development, Implementation, Testing and Support for variety of ETL related tasks on Data warehouses, ODS, data Migration and Interfaces using IBM DataStage (Server & Parallel Extender), Quality Stage.
- Experienced in DataStage client tools like DataStage Designer, DataStage Director, DataStage Manager.
- Strictly followed SDLC and contributed for deliverable including Technical and non - Technical documents.
- Experience in Integration of various data sources like Oracle, DB2UDB, SQL Server, SAP/R2, XML and Ms-Access. Involved in writing database procedures and functions and fine tuning SQL for performance optimization.
- Transformed data from one server to other server using tools like SQL Server Integration Services (SSIS).
- Created derivations and business rules to be used by ETL for mapping source data for the population of the Warehouse and Data Marts.
- Good experience in writing UNIX Shell Scripts including Scripts to automate file manipulation and data loading procedures.
- Strong Knowledge of Data Warehouse Architecture, STAR Schema, Snow flake Schema, FACT and Dimensional Tables.
- In depth experience in dealing with DataStage Designer stages like Lookup, Join, Merge, Row generator, Transformer, SCD, Remove Duplicate, Sort, Peek, Change capture, Filter, CFF, Copy, Sequential File, FTP, Data Set, ODBC, etc.
- Extensively worked with Parallel Edition for splitting bulk data into subsets to distribute the data to all available processors to achieve best job performance.
- Hands on experience in end-to-end of Data warehousing ETL routines, which includes writing custom scripts, data mining and data quality process.
- Used IBM QualityStage to design data quality assessments, matching, and standardization processes.
- Extensively used Job Activities, Email Notification, and Nested Condition, Sequencer and Terminator activities in Job Sequences.
- Experience in using AutoSys Scheduler Tool to schedule DataStage jobs.
- Performed Debugging, troubleshooting, monitoring and performance tuning using DataStage.
- Application Data warehousing experience including Financial, Retail and Insurance Domains. Proven ability to implement technology based solutions for business problems.
- Worked with various data structures.
- Excellent communication skills, good organizational skills, outgoing personality, self-motivated, hardworking, ability to work independently or cooperatively in a team, eager to learn, ability to grasp quickly.
TECHNICAL SKILLS
ETL Tools: IBM Info sphere DataStage 11.5/8.7/8.5/8.0.1/7.5
Operating Systems: UNIX, Windows NT/ 2000/XP, Windows7 OS, Ascential DataStage 7.5.2
Database: Oracle11g/10g/9i/8i, MS Access, DB2, MS-SQL Server, Netezza databases.
Languages: SQL, PL/SQL, C++, UNIX Shell scripting
DW/ETL Concepts: IBM InfoSphere DataStage Suite, Ascential DataStage 8.0.1/7.5.2/7.1/7.0 Parallel Extender (PE), Information Analyzer, Metastage 6.0, Cognos 6.0/5.x, Data Mining,Data mart, OLAP and OLTP.
Data Modelling: Data Modeling, Star Schema Modeling, Snow-Flake Modeling, FACT and Dimensions, Physical and Logical Data Modeling, Erwin 3.5.2/3.X
PROFESSIONAL EXPERIENCE
Confidential, Medina, OH
ETL/ DataStage Developer
Responsibilities:
- Designed the ETL jobs using IBM infoSphere Information Server from 11.5/8.5 to Extract, Transform and load the data into Staging and then into Oracle Database.
- Experience working with file transfer methods (e.g. FTP).
- Involved in testing different interfaces and web application for facets.
- Proven experience working in data management, configuration and logical data modeling.
- Proficiency in understanding relational database concepts.
- Designed IBM infosphere parallel jobs using stages such as Join, Merge, Lookup, Remove Duplicates, Copy, Filter, Funnel, Dataset, Lookup, Pivot, and Sort, Surrogate key Generator, Change Data Capture (CDC), Modify, Row Generator and Aggregator.
- Tested the changes for the front end screen in facets related to following module, test the facets batches (membership).
- Written SQL/ Netezza queries to validate data, as per business requirements.
- Extensively worked with Shared Containers for Re-using the Business functionality.
- Extensively used shared containers to capture row count.
- Created few jobs for validating the data.
- Used Control-M scheduler.
- Worked in upgrading the Datastage version from 8.5 to 11.5.
- Performed the Integration and System testing on the ETL application.
- Worked in backing up of the jobs using IBM infosphere Data Stage Manager Export/Import utility.
- Working with BI team to apply the business rules.
Environment: IBM Infosphere DataStage 11.5/8.5 (Designer, Director), Netezza, Oracle 10g, SQL Server, DB2, Flat files.
Confidential, O’Fallon MO
Sr. ETL/ DataStage Developer
Responsibilities:
- Designed the ETL jobs using IBM infoSphere Information Server 8.7 to Extract, Transform and load the data into Staging and then into Oracle Database.
- Extensively used the designer to develop various parallel jobs to extract, transform, integrate and load the data into Corporate Data warehouse (CDW).
- Designed parallel jobs using stages such as Join, Merge, Lookup, Remove Duplicates, Copy, Filter, Funnel, Dataset, Lookup, Pivot, and Sort, Surrogate key Generator, Change Data Capture (CDC), Modify, Row Generator and Aggregator.
- Transformed data from one server to other server using tools like SQL Server Integration Services (SSIS).
- Experience in using SSIS tools like Import and Export package Installation and SSIS Package Designer.
- Handled Performance Tuning of Jobs to ensure faster Data Loads.
- Extensively worked with Shared Containers for Re-using the Business functionality.
- Created Job Parameters and Environment variables to run the same job for different schemas.
- Used to do code reviews for all the jobs before moving to production.
- Used ETL Job Control to run the jobs.
- Extensively used shared containers to capture row count.
- Used Director to monitoring jobs, run and validating its components.
- Involved in the Performance Tuning of the DataStage jobs using different methodologies like configurations of the environment variable file, designing and editing configurations, increasing the reading as well as the writing speed while fetching or loading data to files or databases.
- Migrated projects from development to QA to Production environments.
- Performed the Integration and System testing on the ETL application.
- Taking the regular backups of the jobs using Data Stage Manager Export/Import utility.
- Working with BI team to apply the business rules for OLAP, designing the Frame Work models.
- Assisted operation support team for transactional data loads in developing SQL & Unix scripts.
- Participated in Unit testing and Integration testing of DataStage jobs.
Environment: IBM Infosphere DataStage 8.7 (Designer, Director and Administrator), IBM AIX UNIX, Oracle 10g, SQL Server, DB2, Flat files.
Confidential, San Antonio, TX
DataStage developer
Responsibilities:
- Designed the ETL jobs using IBM infoSphere Information Server 8.5 to Extract, Transform and load the data into Staging and then into Oracle Database.
- Extensively used the designer to develop various parallel jobs to extract, transform, integrate and load the data into Corporate Data Warehouse (CDW).
- Designed and developed the ETL jobs using Parallel Edition which distributed the incoming data concurrently across all the processors, to achieve the best performance.
- Designed parallel jobs using stages such as Join, Merge, Lookup, Remove Duplicates, Copy, Filter, Funnel, Dataset, Lookup, Pivot, and Sort, Surrogate key Generator, Modify and Aggregator.
- Handled Performance Tuning of Jobs to ensure faster Data Loads.
- Created Master controlling sequencer jobs using DataStage Job Sequences.
- Extensively worked with Shared Containers for Re-using the Business functionality.
- Extensively developed and deployed UNIX Shell scripts that provide parameter values to DataStage jobs during runtime.
- Created Parameter Sets and Value Files to run the same job for different schemas.
- Worked with Slowly Changing dimension stage for implementing slowly changing dimensions.
- Used Director to monitor jobs, run and validating its components.
- Provided data mappings (extract, transform and load analysis) of the data marts and data feed systems in the aggregation efforts.
- Migrated projects through DSX executable files from Development to Test environment and Production environments.
- Performed the Integration and System testing on the ETL modules.
- Worked with BI team to apply the business rules for online analytical processing (OLAP), and for designing the Frame Work models.
- Assisted operation support team for transactional data loads in developing SQL & Unix scripts
- Participated in Unit testing and Integration testing of DataStage jobs.
- Scheduled jobs using AutoSys job scheduler utility based on the control flow and monitored the production jobs closely for any possible errors.
Environment: Flat files, Autosys. DataStage 8.5 (Designer, Manager, Director), Oracle 9i, TOAD, SQL/PLSQL, Teradata, Erwin 4.0, UNIX (AIX) and Windows NT/2000.
Confidential, Seattle, WA.
DataStage Developer
Responsibilities:
- Identified data elements to create source to target mapping documents.
- Involved inExtraction, Transformation and Loadingof the source data to the target Data Warehouse database.
- Worked very closely with analysts to determine data requirements and identify data sources, provide estimates for task duration.
- Used theWebSphere DataStage Designerto develop processes forextracting, cleansing, transforming, integrating and loadingdata into data warehouse database.
- Extracted sources from Databases (including Oracle 10g and Netezza) to transform and load into the Target Database.
- Extracted various XML Source files to load them into the Target database.
- UsedDataStage Designerfor importing metadata from repository, new job categories and creating new data elements.
- Effectively usedDataStage Designerto Import/Export projects from development server to production server. Parameterized jobs for changing environments.
- Worked extensively ondifferent types of stageslikeSequential file, Data Set, Aggregator, Transformer, Merge, Join, Lookup, Sort, Remove Duplicates, Funnel and Containersfor developing jobs.
- UsedParallel Extender for parallel processingfor improving performance when extracting the data from the sources.
- Involved in resolving various production issues on priority basis.
Environment: IBM Information Server/ infoSphere DataStage 8.0.1(DataStage and QualityStage), Tools & Utilities, Netezza, UNIX, Oracle 10g, TOAD, Windows 2003/XP, SHELL SCRIPTING.
Confidential, Santa Clara, CA
DataStage Developer
Responsibilities:
- Worked on mappings between multiple sources and Oracle target tables in Oracle Applications based on the functional design document (FDD).
- Interpret mapping documentation, and translate into detailed design specifications.
- Coordinated with System architects to design comprehensive solution.
- Worked on multiple sources: Oracle, SQL Server, Sequential files, XML files.
- Developed parallel jobs using most of the stages like Sequential file, change capture, funnel, transformer, data set, ODBC stage etc.
- Assist in defining a test plan for functional/quality assurance testing.
- Performed Unit Testing and System/Integration testing. Design and documented the test cases for unit testing.
- Extensively used Data Transforms like Data type Conversion, Null Handling, Date & timestamp.
- Used DataStage to extract, transform and load data from different input sources like flat files, Oracle into Oracle database.
- Estimating the efforts for Projects, Change Requests and Enhancements from a technical standpoint.
- Designed DataStage job with CDC (change data capture) to capture all records and transfer file over to the customer on a daily basis.
- Used AUTOSYS job scheduler to schedule all batch jobs and their dependencies to automate the process.
- Created job sequences to implement job dependencies.
- Generated Surrogate Keys for composite attributes while loading the data into Target tables.
- Written wrapper shell scripts to run data stage jobs and scheduling the jobs.
Environment: IBM InfoSphere Information Server 8.0.1, Metadata Workbench 8.5, Oracle11g, TOAD.
Confidential
Software Developer
Responsibilities:
- Installed the database and overseen the implementation of users and security of the application's database.
- Set the environment of the client database using the export files.
- Involved in requirement analysis and design documentation.
- Designed and developed database tables necessary along with the necessary constraints to enforce business rules.
- Created and modified several Database Triggers, especially for Audit and Security purposes.
- Developed the Stored Procedures, Packages, and Cursors and involved in the performance tuning of PL/SQL.
- Developed packages for various financial analyses.
- Resolved several hardcore business rules and issues at the client site.
- Developed several Complex Reports for various requirements using to increase the functionality and usability.
- Involved in production support for the client.
- Corrected the major form bugs, errors through client and end user interaction.
Environment: Oracle8, PL/SQL, Oracle Import, Developer 2000.