Etl Developer Resume
Newark, NJ
SUMMARY
- Highly Motivated, Solutions Driven with over 7 years of Data Warehousing experience in the areas of ETL design and Development.
- Involved in complete Software Development life - cycle (SDLC) of various projects, including Requirements gathering, System Designing, Data modeling, and ETL design, development, Production Enhancements, Support and Maintenance.
- Excellent Interpersonal and communication skills with an ability to remain highly focused and self-assured in fast-paced and high-pressure environments.
- Extensive ETL tool experience using IBM Infosphere/Websphere DataStage, Ascential DataStage.
- Worked on DataStage tools like DataStage Designer, DataStage Director and DataStage Administrator.
- Strong understanding of the principles of Data Warehousing using fact tables, dimension tables and star/snowflake schema modeling.
- Worked extensively with Dimensional modeling, Data migration, Data cleansing, ETL Processes for data warehouses.
- Used Enterprise Edition/Parallel stages like Datasets, Sort, Join, Lookup, Change Data Capture, Funnel, Row Generator and many other stages in accomplishing the ETL Coding
- Familiar in using highly scalable parallel processing infrastructure using parallel jobs and multiple node configuration files.
- Experienced in scheduling Sequence and parallel jobs using DataStage Director, UNIX scripts and scheduling tools.
- Experience in troubleshooting of jobs and addressing production issues like data issues, ENV issues, performance tuning and enhancements.
- Knowledge in using Erwin as leading Data modeling tool for logical (LDM) and physical data model (PDM).
- Extensive experience in design and development of Decision Support Systems (DSS).
- Assisted in development efforts for Data marts and Reporting.
- Technical and analytical skills with clear understanding of design goals of ER modeling for OLTP and dimension modeling for OLAP.
- Extensive experience in Unit Testing, Functional Testing, System Testing, Integration Testing, Regression Testing, User Acceptance Testing (UAT) and Performance Testing.
- Worked with various databases like Oracle 10g/9i/8i, DB2, SQL Server, Teradata.
TECHNICAL SKILLS
ETL Tools: IBM Infosphere DataStage 8.5, IBM Websphere DataStage 8.0.1 (Designer, Director, Administrator), Ascential DataStage 7.5.2 (Designer, Director, Administrator, Manager).
Database: Oracle 10g/9i/8i, IBM DB2/UDB, Teradata, SQL Server 2003/2005/2008.
Data Warehousing: Star & Snow-Flake schema Modeling, Fact and Dimensions, Physical and Logical Data Modeling, ERwin. Business Objects XI R2, Cognos, ReportNet
Operating systems: Windows 7x/NT/XP, UNIX, LINUX, Solaris, MS-DOS
Languages/Scripting: C, C++, Java, D2K, Visual Basic, PL/SQL, UNIX Shell scripts
Testing/Defect Tracking: HP Quality Center, Test Director, Bugzilla, Jira
PROFESSIONAL EXPERIENCE
Confidential, Austin, TX
Sr. ETL Datastage Developer
Responsibilities:
- Analyzed, designed, developed, implemented and maintained Parallel jobs using IBM info sphere Data stage.
- Involved in design of dimensional data model - Star schema and Snow Flake Schema
- Generating DB scripts from Data modeling tool and Creation of physical tables in DB.
- Worked SCDs to populate Type I and Type II slowly changing dimension tables from several operational source files
- Created some routines (Before-After, Transform function) used across the project.
- Experienced in PX file stages that include Complex Flat File stage, DataSet stage, LookUp File Stage, Sequential file stage.
- Implemented Shared container for multiple jobs and Local containers for same job as per requirements.
- Adept knowledge and experience in mapping source to target data using IBM Data Stage 8.x
- Implemented multi-node declaration using configuration files (APT Config file) for performance enhancement.
- Debug, test and fix the transformation logic applied in the parallel jobs
- Involved in creating UNIX shell scripts for database connectivity and executing queries in parallel job execution.
- Used the ETL Data Stage Director to schedule and running the jobs, testing and debugging its components & monitoring performance statistics.
- Experienced in using SQL *Loader and import utility in TOAD to populate tables in the data warehouse.
- Successfully implemented pipeline and partitioning parallelism techniques and ensured load balancing of data.
- Deployed different partitioning methods like Hash by column, Round Robin, Entire, Modulus, and Range for bulk data loading and for performance boost.
- Repartitioned job flow by determining DataStage PX best available resource consumption.
- Created Universes and reports in Business object Designer.
- Created, implemented, modified and maintained the business simple to complex reports using Business objects reporting module.
Environment: IBM Info sphere DataStage 8.5, Oracle 11g, Flat files, Autosys, UNIX, Erwin, TOAD, MS SQL Server database, XML files, MS Access database.
Confidential, Milwaukee, WI
ETL Datastage Developer
Responsibilities:
- Designed the ETL jobs using IBM Web Sphere Data stage 8.0.1 to Extract, Transform and load the data into Staging and then into Target Database.
- Designed and developed the ETL jobs using Parallel edition which distributed the incoming data concurrently across all the processors, to achieve the best performance.
- Handled Performance Tuning of Jobs to ensure faster Data Loads
- Created Master controlling sequencer jobs using DataStage Job Sequencer.
- Extensively worked with Shared Containers for Re-using the Business functionality.
- Extensively developed and deployed UNIX Shell scripts as wrappers that provide values to DataStage jobs during runtime.
- Created Job Parameters and Environment variables to run the same job for different sources and targets .
- Used Director to monitor jobs, run and validating its components.
- Extensive worked with jobs export, jobs import and multi job compilation etc.
- Provided data models and dataflow (extract, transform and load analysis) of the data marts and feeder/target systems in the aggregation effort.
- Migrated projects from development to QA to Production environments
- Performed the Integration and System testing on the ETL processes.
- Taking the regular backups of the jobs using Data Stage Export/Import utility.
- Working with BI team to apply the business rules for OLAP, designing the Frame Work models.
- Assisted operation support team for transactional data loads in developing SQL & Unix scripts
- Scheduled jobs using Autosys job scheduler utility based on the requirements and monitored the production closely for any possible errors.
Environment: IBM InfoSphere DataStage 8.5, IBM Websphere DataStage 8.0.1, IBM AIX 5.2, Oracle 10g, XML files, Autosys, MS SQL Server database, sequential flat files, TOAD.
Confidential, Newark, NJ
ETL Developer
Responsibilities:
- Worked with the Business analysts and the DBAs for requirements gathering, analysis, testing, and metrics and project coordination.
- Successfully handled the slowly changing dimensions.
- Involved in the Dimensional modeling of the Data warehouse.
- Developed documents like Source to Target mapping for developing the ETL jobs.
- Worked with DataStage server stages like OCI, ODBC, Transformer, Hash file, Sequential file, Aggregator, Sort, Merge, and other stages.
- Imported the required Metadata from heterogeneous sources at the project level.
- Involved in designing various jobs using PX.
- Extensive worked with jobs export, jobs import and multi job compilation etc.
- Developed Parallel jobs using Parallel stages like: Merge, Join, Lookup, Transformer (Parallel), Teradata Enterprise Stage, Funnel, Dataset, Oracle Enterprise Stage.
- Performed debugging on these jobs using Peek stage by outputting the data to Job Log or a stage.
- Used Remove Duplicates stage to remove the duplicates in the data.
- Involved in the migration of DataStage jobs from Development to Production environment.
- Worked on implementation job performance tuning techniques.
- Designed and implemented several wrappers to execute the DataStage jobs, create job reports out of the DataStage job execution results from shell scripts.
- Designed and implemented wrappers to execute the DataStage jobs from remote servers.
- Worked on database connections, SQL joins, views, aggregate conditions, parsing of objects and hierarchies.
- Tuned SQL queries for better performance for processing business logic in the database.
Environment: Ascential DataStage 7.5.2 (Designer, Manager, Director, Administrator), Oracle 9i, TOAD, SQL/PLSQL, Teradata, Erwin 4.0, UNIX (AIX).
Confidential, Strongsville, OH
ETL Datastage Developer
Responsibilities:
- Designed the ETL jobs using IBM WebSphere Information Server 8.1.
- Designed and developed the ETL jobs using Parallel Extender which distributed the incoming data concurrently across all the processors, to achieve the best performance.
- Handled Performance Tuning of Jobs to ensure faster Data Loads
- Extensively developed and deployed UNIX Shell scripts as wrappers that provide values to DataStage jobs during runtime.
- Worked with Slowly Changing dimension stage for implementing slowly changing dimensions.
- Used Director to monitoring jobs, run and validating its components.
- Developed Parallel jobs using Parallel stages like: Merge, Join, Lookup, Transformer (Parallel), Teradata Enterprise Stage, Funnel, Dataset, Oracle Enterprise Stage.
- Imeplmented various job performance tuning techniques to improve job performace
- Extensive worked with jobs export, jobs import and multi job compilation etc.
- Migrated projects from development to QA and Production environments.
- Performed the Integration and System testing on the ETL application.
- Taking the regular backups of the jobs using Data Stage Manager Export/Import utility.
- Assisted operation support team for transactional data loads in developing SQL & Unix scripts
- Participated in Unit testing and Integration testing of DataStage jobs.
- Scheduled jobs using Autosys job scheduler utility based on the requirements and monitored the production closely for any possible errors.
Environment: IBM Infosphere DataStage 8.1 (Parallel & Server), IBM AIX 5.2, Oracle 10g, Flat files, Autosys, Clearcase, TOAD.
Confidential, Deerfield, IL
Datastage Developer
Responsibilities:
- Extensively used DataStage Designer to develop processes for extracting, transforming, integrating and loading data from various sources into the Data Warehouse database.
- Developed several complex ETL jobs for Historical data loads and ongoing data loads using various active and passive stages.
- Implemented job control execution in sequence jobs to execute multiple parallel jobs with key functionality of job controls and job failure handlings.
- Extensively used SQL queries to retrieve data from various database tables and used those queries in the jobs to reduce the complexity of processing data.
- Upgraded to IBM Websphere DataStage 8.0.1 from Ascential DataStage 7.5.2.
- Involved in performance tuning of the jobs while developing the jobs.
- Used Datastage Manager for importing metadata from repository, new job categories and creating new data elements.
- Extensively worked with various restructure stages like column import, column export stages etc.
- Created the various complex jobs to load the data from staging area to the application data base and from there to data mart.
- Used Parallel Extender for splitting the data into subsets and to load data, utilized the available processors to achieve job performance, configuration management of system resources in Orchestrate environment.
- Executed various test case scripts and data validations scripts
- Developed batches and sequencers in designer to run and control set of jobs.
- Involved in importing and exporting jobs category wise and maintaining the backup regularly.
- Performed the Unit testing for jobs developed to ensure that it meets the requirements.
- Developed UNIX shell scripts to automate file manipulation and data loading procedures.
Environment: IBM WebSphere DataStage 8.0.1 (Designer, Director, Administrator), Ascential Datastage 7.5.2 (Director, Designer, Manager, Administrator), DB2, Teradata, SQL Server, Oracle 9i, Clear Case.
Confidential
DW/ETL Tester
Responsibilities:
- Designed and created various Test Plans, Test Cases based on the Business requirements.
- Involved in extensive DATA validation using SQL queries and back-end testing
- Used SQL for Querying the DB2 database in UNIX environment
- Developed test reports and participated in testing prioritization and archived test results and user signoff
- Involve with Design and Development team to implement the requirements.
- Performed execution of test cases manually to verify the expected results.
- Used Mercury Quality Center 9.0 to state requirements, business components, test cases, test runs for every iterations, defects. Also link defects with requirements
- Performed all aspects of verification, validation including functional, structural, regression, load and system testing.
- Worked with other members of the QA and Development teams and offshore team (INDIA) in improving the processes, tools, methods, effectiveness and efficiency
- Used TOAD Software for Querying Database.
- Participated in walkthrough and defect report meetings periodically.
- Involved in Unit, Functional, Regression and System testing.
- Documented and reported various bugs during Manual Testing.
- Stored data in the Data Warehouse about Customers, Accounts. The loads were scheduled (Daily and weekly) depending on the frequency of source data.
- Written SQL Queries to define, Identify and validate the code written for the data movement into the database tables.
- Retesting the resolved defects/issues after the developers fix the defects
- Performed Installation testing and Performance testing manually.
Environment: ETL (Data stage), UNIX, Test Director, DB2, Oracle, Autosys, Windows XP, Toad