Datastage Developer Resume
Des Moines, IA
SUMMARY
- 7 years of IT experience specializing in Analysis, Design and Development of ETL processes in all phases of the Data Warehousing life cycle with expertise in Decision Support Systems and OLAP technology.
- Extensive experience in DataStage 8.1/7.5/6.x using DataStage Designer, DataStage Director, DataStage Administrator, QualityStage and Parallel Extender.
- Expert in writing, testing and implementing procedures, functions and triggers at database level using PL/SQL.
- Strong in writing UNIX shell scripts.
- Efficient in all phases of the development lifecycle, coherent with data cleansing, data conversion, performance tuning and system testing.
- Expertise in developing and maintaining overall Test Methodology and Strategy, Documenting Test Plans, Test Cases and editing, executing Test Cases and Test Scripts.
- Strong knowledge of both Star and Snow Flake Schema.
- Efficient in in corporation of various data sources such as Oracle, MS SQL Server, XML and Flat files into the staging area.
- Involved in coordinating calls with source systems, clients and with other business partners.
- Good experience is maintaining the versioning and deployment instruction documents while moving the code to various environments.
- Extensively worked with production support team to resolve issues and worked in extracting data from Oracle, XML and flat files.
- Experienced in troubleshooting, performance tuning and performance monitoring for enhancement of DataStage jobs.
- Hands on Experience in SQL, UNIX, Shell Scripting, Data Relationships, Data extraction & Validation.
- Good exposure to development, testing, debugging, implementation, documentation, user training & production support.
- Extensively used SCCS and SVN for version control.
- Resolving data issues, complete unit testing and complete system documentation for ETL processes
- Experience on Creating detailed project outlines and application design specifications.
- Involved extensively in GUI, System and Regression Testing.
- Good at handling TWS and Autosys handler.
- Experienced in IBM Quality Stage using Investigate Stage, Standardize Stage, Match and Survive stages.
- Proven track record in troubleshooting of DataStage jobs and addressing production issues excellent knowledge of studying the data dependencies using Metadata stored in the DataStage Repository.
- Experience on SSIS tool to do ETL operations.
- Strong understanding of business processes with excellent writing and documentation skills for the management and development.
- Excellent track record as a leader and team player with effective communication skills.
- Substantial ability to employ knowledge based solutions for industry tribulations.
TECHNICAL SKILLS
ETL Tools: InfoSphere DataStage v8.x/7.5.3/7.x, 6.x, Designer, Director, Manager and Administrator, QualityStage, Informatica.
Data Modeling: Dimensional Data Modeling, Star Schema, Snow - Flake Schema, Physical and Logical Data Modeling.
Operating Systems: Windows 95/98/NT/2000/XP, Unix, AIX, Linux
Database: DB2, Oracle 11g/10g/9i/ 8i/8.x/, SQL Server, Vertica, PL/SQL
Languages: SQL, PL/SQL, UNIX Shell Scripting.
Tools: TOAD, SQL PLUS, Autosys, and SQL Navigator, Rapid SQL.
Application Suite: MS Office, MS Project, MS Share Point, Microsoft Visio Professional 5.0/2002.
Testing Tools: HP Quality Center, WinRunner, LoadRunner and Silk Test.
PROFESSIONAL EXPERIENCE
Confidential, Des Moines IA
DataStage Developer
Environment: DataStage v8.1.2, Oracle 11g, UNIX, Autosys, HP Quality Center.
Responsibilities:
- Involved in Business Analysis and Requirements Collection.
- Implemented the Functional Database Design based on the business needs and data profiling analysis.
- Involved for the Dimensional Data Modeling and populating the business rules using mappings into the Repository for Meta Data management.
- Involved in the creation of mapping documents from data warehouse to data marts.
- Responsible for designing and developing ETL jobs using websphere datastage.
- Developed ETL best practices document to maintain same standards all over the project.
- Worked with different sources such as flat files, sequential files, and oracle database.
- Worked with Parallel stages like Join, Lookup, Merge, Change Capture, Remove Duplicates, Aggregator, Filter, and Funnel and debug stages like Peek, Column generator and Row generator.
- Implemented slowly changing dimensions.
- Worked with Real Time stages like XML Input, XML Output and XML Transformer.
- Creating project level parameters and reusable components like shared containers.
- Extensively worked with Job Sequence such as User Variable Activity, Notification Activity, Routine Activity, Terminator Activity, etc
- Developed DS Routines for Job Auditing.
- Used quality stage for name standardization and for address survivorships.
- Involved in creating PL/SQL Stored Procedures, Packages, and Triggers for the application, tuning SQL queries and the database
- Creating the JILs and automating the Autosys jobs as per project needs and scheduled the DataStage ETL batch jobs on a daily, weekly and monthly basis.
- Developed Shell Scripts for event automation and scheduling.
- Prepared Unit test cases and UAT test cases to ensure the data quality.
- Interacted with User Groups/ Corporate Testing Groups/Business people.
- Tuned ETL jobs for performance improvement.
- Implemented change data capture (CDC) logic.
- Tuned oracle SQL queries by creating indexes on the key column for better performance.
Confidential, NY
DataStage Developer
Environment: Data stage 8.0, Parallel Extender, Oracle 10g/9i, SQL*Plus, Oracle SQL*Loader, SQL, PL/SQL, Autosys, HP Quality Center.
Responsibilities:
- Participated in all phases of project cycle including Requirement Analysis, Client Interaction, Design, Coding, Testing, support and Documentation.
- Used DataStage Designer to develop jobs for extracting, cleansing, transforming, integrating, and loading data into data warehouse database.
- Developed parallel jobs for extracting, transforming and loading data to targets making use of various stages like Sequential File, Join Stage, Merge Stage, Aggregator, Surrogate Key Generator and Transformer.
- Made use of Parallel Job Extender for parallel processing in various instances so as to reduce the complexities and expediting the processes.
- Developed various transformations in the process of building Data warehouse database.
- Utilized slowly changing Dimensions for tracking changes to dimensions over time.
- Developed Jobs to load Dimension tables and Fact tables in EDW.
- Developed shared containers for code reusability.
- Developed Unix shell scripts to automate the Data Load processes to the target Data warehouse
- Designed complex DataStage Jobs, sequences and tuned them for better performance.
- Responsible for preparing ETL Documentation for the developed processes.
- Extract data from different source systems (Mainframes, Flat files, Oracle DB) move them into the warehouse and mart.
- Studied the existing data sources with a view to know whether they support the required reporting and generated change data capture request.
- Involved in designing batch jobs for updating some base objects.
- Responsible for handling Production Support tickets and creating Autosys scheduling jobs and to monitor the PROD runs.
- Schedule the jobs developed as per the time mentioned by the business users and monitor the jobs and fix the issue if there is any job failure.
- Write oracle Pl/Sql scripts, and UNIX shell scripts as per the functional and technical specs provided by the client.
- Developed UNIX Scripts to purge the data file on Production servers to free the disk space.
- Responsible for preparing Unit test cases and validating the data as per the business rules.
- Involved in tuning many Oracle scripts and other ETL processes used in this project.
- Involved in preparing Integration test cases, UAT test cases and involved in UAT testing.
- Co-ordinate the offshore dev and testing teams.
- Involved in conducting knowledge sharing sessions for end-users, to business and to offshore team members
Confidential, Emeryville, CA
DataStage Developer
Environment: Data stage 7.5 PX, UNIX, SQLServer2000, Oracle 9i, and Control M 6.2
Responsibilities:
- Involved in designing Star Schema dimensional model for the project.
- Performed change control for all Data stage jobs Monitor the Data-stage engine. Tuned and configured for better performance Move the Datastage jobs as per service Request from one environment to other environment.
- Involved in ETL Architecture and Design proposed for entire implementation.
- Identified high-level components for the ETL program. Designed ETL components for Change requests
- Developed parallel, server jobs and job sequences using DataStage PX.
- Developed job specifications and unit test cases for parallel jobs and job sequences.
- Worked with Lookup, Merge, Joiner, Funnel, Transformer, MQ Stages, External procedure Stage etc.
- UsedParallelExtenderfordistributingloadamongdifferentprocessorsbyImplementingpipelineandPartitioningofdatainparallelextender.
- Involved in creating, administering repositories, Folders, Permissions
- Migrating the code from Dev to Test and Test to Production
- Installed patches, Datastage stages and upgrade to latest version Backup of Datastage project and restore as and when required created the Metastage repository and users.
- Identified high-level components for the ETL program. Designed ETL components for Change requests
- Utilized the stages of Job Sequence such as User Variable Activity, Notification Activity, Routine Activity, Terminator Activity, etc.
- Developed various shared containers, stages and validated and fine-tuned the ETL logic coded into jobs.
- Responsible for validating the Datastage jobs, sequences against the pre-defined ETL design standards.
- Analyzed the requirements to identify the necessary tables that need to be populated into the staging database.
- Prepared the DDL’s for the staging/work tables and coordinated with DBA for creating the development environment and data models.
- Involved in analyzing the source data coming from different Data sources such as oracle, flat files etc.
- Involved in defining technical and functional specifications for ETL process.
- Used both pipeline and partition parallelism for improving performance.
- Extensively used third party tool which is Control M for job scheduling.
- Used lookup stage with reference to oracle tables for insert/update strategy and updating of slowly changing dimensions.
- Involved in performance tuning of the ETL process and performed the data warehouse testing.
Confidential
ETL DataStage Developer
Environment: Data stage 7.5 PX, SQLServer2000, Oracle 8i, SQL, PL/SQL, CDC, UNIX shell scripting.
Responsibilities:
- Project Management and coordinating development activities with external clients.
- Extensively worked with Datastage - Manager, Designer, Director and Administrator to load data from flat files, legacy data, to target Oracle database
- Used Datastage for subjecting the data to multiple stages, thereby transforming it and prepared documentation by doing data analysis.
- Designed and developed jobs and tuned existing jobs for better performance using DataStage and Orchestrate.
- Wrote several triggers to support the extraction process in the back-end using Oracle PL/SQL.
- Created DataStage jobs, batches and job sequences and tuned then for better performance.
- Performed data loading with multiple and parallel ETL processes.
- Developed Shell Scripts for event automation and scheduling.
- Understanding business needs and implementing the same into a functional database design.
- Maintaining warehouse metadata, naming standards and warehouse standards for future application development.
- Tuned oracle SQL queries by creating indexes on the key column.
- Involved in creating PL/SQL Stored Procedures, Packages, and Triggers for the application, tuning SQL queries and the database
- Implemented the Functional Database Design based on the business needs.
- Responsible for the Dimensional Data Modeling and populating the business rules using mappings into the Repository for Meta Data management.
- Defined the ETL strategy for data population from Oracle databases.
- Responsible for UNIT, System and Integration testing. Developed Test scripts, Test plan and Test Data.
- Identifying areas for improvements and implementing the same
- Design Business Mapping sheet and Test Case scenario docs
- Implemented change data capture (CDC) logic to check for changes in data.
- Guided Onsite and Offshore project team on technical issue resolution.
Confidential
ETL Developer
Environment: Datastage, Oracle 8i, Windows 2000, SQL, PL/SQL Toad 7.0, UNIX.
Responsibilities:
- Gathering detailed requirements and designing a High Level design by using Visio.
- Implemented and wrote ETL programs to load Intermediary dimension tables.
- Coding and testing ETL jobs and UNIX scripts.
- Worked on Complex ETL programs to implement multilevel hierarchy in dimensional tables.
- Extensively worked with Manager, Designer, Director to define the Source and Target objects. Extracted Source Data using Datastage tools and stored procedure
- Worked Extensively on Datastage Director for scheduling the Job to run, emailing production support for troubleshooting from LOG Files.
- Developed SQL, PL/SQL Stored procedures, triggers and cursors for implementing Complex business rules and transformations.
- Developed UNIX scripts using K-Shell, SED & UNIX commands to perform net change (incremental data). Developed UNIX Scripts to develop Driver programs to run Datastage Job Control Sequence. Used DS commands to run the Datastage Job Control Sequence.
- Generated completion messages, status reports using capabilities Job Control Sequence and UNIX.
- Responsible for UNIT, System and Integration testing. Developed Test scripts, Test plan and Test Data. Participated in UAT (User Acceptance Testing).
- Involved in the Analysis of Physical Data Model for ETL mapping and the process flow diagrams.
- Created and maintained metadata and ETL documentation that supported business rules and detailed source to target data mappings.
- Designed and wrote data extraction, transformation, clean and load.
- Used different stages such as the Aggregators, lookups, Joins, Filters.etc.
- Created and maintained data governance strategies that reduced data redundancy wherever possible.
- Investigated and assisted users in the resolution of any day-to-day data warehouse issues.