Sr Etl Developer Resume
Hoboken-new, JerseY
SUMMARY:
- Over 8 years of IT experience in implementation of Data Warehouse, ETL and MDM applications using Informatica MDM, IDD, IDQ, Power Center and Power Exchange tools.
- Experience in design, development and implementation of the Extract, Transform and Load ETL and Extract Load and Transform (ELT) processes and development of transformation standards and procedures using Informatica best practices.
- Extensively worked on Teradata BTEQ scripts, Utilities (Fast Export, Fast Load, Multi Load, TPump), Teradata Parallel Transporter, and SQL Assistant.
- Experience in configuration of Informatica Data Director (IDD) to meet data governance objectives in various project life cycles and MDM hierarchies' development.
- Worked extensively on (IDQ)Informatica Data Quality Versions 9.x.
- Extensive experience in developing complex mappings from varied transformations like Router, Filter, Sorter, Connected and Unconnected lookups, Normalizer, Expression, Aggregator, Joiner, Union, Update Strategy, Stored Procedure and Sequence Generator
- Expertise in design and implementation of Slowly Changing Dimensions (SCD) type1, type2, type3.
- Expertise on optimizing and tuning the Netezza SQLs to improve the performance.
- Experience in loading data, troubleshooting, Debugging mappings, performance tuning of Informatica (Sources, Targets, Mappings and Sessions) and fine - tuned transformations to make them more efficient in terms of session performance.
- Database experience using Oracle MS SQL Server 2000 and MS Access Netezza.
- Strong experience using SQL, PL/SQL Procedures/Functions, Triggers and Packages.
- Experience in UNIX Operating System and Shell scripting.
- Working knowledge of data warehouse techniques and practices, experience including ETL processes, dimensional data modeling (Star Schema, Snow Flake Schema, FACT & Dimension Tables), OLTP and OLAP.
- Experience in all data processing phases, from the Enterprise Data Model, Logical and Physical Model, Dimensional Modeling, designing and modeling Star & Snow flake schemas, Data Warehousing (ETL)
- Extensive experience using database tool such as SQL Developer, Win SQLSQL *Plus, SQL * Loader.
- Good knowledge of business intelligence tool Business Objects .
- Extensively worked on UNIX shell scripting to implement different business logic.
- Excellent Knowledge and Understanding along with some hands-on experience on various third parties Scheduling Tools like, Autosys, Control-M,TIDAL
- Expertise in SQL/PLSQL programming development and execution of Stored Procedures, Functions and Packages. Experience in Loader utilities including SQL Loader and Teradata TPump, Fast Load, Multiload.
- Knowledge, Understanding and Some hands-on Experience in different Business Intelligence Tools like Business Objects, Cognos.
TECHNICAL SKILLS:
Data Warehousing/ ETL: MDM 10.x/9.x, Informatica Power Center 8.x, 7.x, Informatica Data Quality(IDQ),IDE, ETL, Data Mining, DTS, SQL*Plus, SQL*Loader, Business Objects Data Integrator, Data Cleansing, Informatica Data Quality and Metadata Manager
Tools: Informatica IDQ, First Logic (Data Right 2.55c, DocuRight 4.20c, ACE 7.11c, Match/Consolidate 7.10c), First Logic IQ Suite /Link, RAPID, First Logic IQ8 Integration Studio and PERL
Data Modeling: Data Modeling, Star Schema, Snow-Flake, Dimensional Data Modeling, FACT and Dimensions Tables, Physical and Logical Data Modeling, Erwin 3.5.2/3.x
Databases: Netezza, Teradata 6.x (Sql Assistant, QueryMan, BTEQ, Multi Load, Fast Load, TPump and Fast Export), Oracle PL/SQL, MS SQL Server 2005, 2008 SSIS, TSQL, Informix, Progress, My Sql, Pervasive SQL
Languages/Scripting: VB, XML, VC++, VBA, JAVA, VB Script, JAVA Script, UML, SQL, T-SQL, Teradata SQL, PL/SQL and Shell Scripting.
UNIX Tools: C Shell, K Shell, Bourne Shell, Perl, AWK, VI, SED.
Scheduling Tools: AppWorx, ROC Maestro, Autosys, Crontab, Tidal and Sterling
Environment: Windows NT Sun Solaris, Sun OS, HP-UX, SCO UNIX, Win MS DOS 6.22, Sun Sparc, HP-9000, IBM-PC Compatibles
PROFESSIONAL EXPERIENCE:
Confidential, Hoboken-New Jersey
Sr ETL Developer
Responsibilities:
- Created ETL design/specification/Mapping documents for the data warehouse.
- Analyzing all the business requirements and source system behavior.
- Designing Data Warehouse / Data Model / MDM and Integration architecture solutions for Hornet Data warehouse requirements to develop data Integration solution.
- Created UNIX Shell scripts to perform different tasks like FTP, Calling Informatica Workflows using pmcmd, Email reports to users every day.
- Created column level Profiles and IDE mappings to analyze the data patterns and statistics for each column in the source.
- Closely worked with Data Steward team for designing, documenting and configuring Informatica Data Director for supporting management of MDM data.
- Worked on Shell scripting for data archiving & environment migrations purposes.
- Created Informatica Mappings/Sessions/Workflows required for different processes.
- Fine tuning the existing Informatica Mappings which are delaying the nightly batch cycle by significant amount of time.
- Excelled in creating and implementing procedures in Netezza database.
- Worked on creating multiple Dynamic Target files using Informatica.
- Worked on database tuning methodologies to improve the performance of the database, Parallelizing complex queries, Created Hints, Indexes, partitions to effectively reduce the response time in Production.
- Involved in decommission of MySQL Server 2005 and migrated required data into Teradata (EDWard Enterprise Data warehouse) by creatingnew mappings, Shell scripts and BTEQ scripts.
- Created xml’s for new Control-M jobs to be scheduled in Production.Worked on Unit testing, System Integration testing, QA/UAT to thoroughly test the new platform's functionality.
- Reporting of the errors/defects occurred during the test execution & raising defects/action items in HP quality center throughout the testing phase.
- Conducting Code review/Code walkthrough meetings to better understand the process flow & make sure there are no issues in production post-go Live.
- Performed Data Profiling using Informatica Data Quality Tool.
- Identified and eliminated duplicate datasets and performed Columns, Primary Key, Foreign Key profiling using IDQ.
- Designed IDQ mappings which are used as Mapplets in Power center.
- Preparing the documentations for the platform Go-Live.
- Developed/Tested & migrated the code (UNIX Shell scripts, SQL scripts, XML’s for Control-M) from QA/UAT to Production environments.
- Migrated data from Oracle SQL Developer to IBM Netezza using Informatica, UNIX shell and Perl scripting.
- Migrated all the mappings from oracle warehouse builder using Informatica and Netezza procedures.
- Migrating the Informatica workflows from QA/UAT to Production environment after testing & signing off the process at each & every level.
- Expertise in design and development of the ETL processes, mappings and BTEQ scripts to enable Incremental processing, error and exception handling, data cleansing and validation.
- Responsible for Updating, tuning and maintaining the test environment Post- Deployment.
- Clearly documenting the Pre/Post Deployment & Deployment steps for step by step implementation during production roll out.
Environment: Informatica Power Center 9.5/9.1, HP ALM/Quality Center 12, Netezza, MDM Hub, IDE/IDQ9.6.1, Oracle 11.2.0.2.0 , PL/SQL, UNIX AIX4.2,Teradata, Windows XP/2000, Control-M Enterprise Manager 7.0.00.600
Confidential, Mount Laurel, NJ
ETL/ Informatica Developer
Responsibilities:
- Involved in creating Detail design documentation to describe program development, logic, coding, testing, changes and corrections. Working on complex ETL processes that are highly performance and meet enterprise standards for availability and fault tolerance in a secure environment.
- Developed MDM solution meeting technical specifications and business requirements
- Expertise in creating and implementing procedures in Netezza database.
- Developed BTEQ (Teradata) queries for extracting data from production database and built reports.
- Served as a module lead for 10 different SSB RIMs and managed 2 members under my team.
- Worked with business SMEs on developing the business rules for cleansing. Applied business rules using Informatica Data Quality (IDQ) tool to cleanse data.
- Participating in User meetings, gathering requirements and translating user inputs into Technical Specification documents. Reviews documentation for ETL processes and ensures consistency of implementation
- Extensively involved in writing ETL Specifications for Development and conversion project. Ensures adherence to best practices for ETL development into standard warehouse models and dimensional data structures.
- Integration design sessions by sharing knowledge of data elements from analysis phase to data structure designers and ensure all elements are accounted for through the final design.
- Involved in requirement definition and analysis in support of Data Warehouse. Involve design and development of large-scale data marts or changes to enterprise data warehouses.
- Develops interfaces to enterprise metadata environments and ensures ETL metadata flows into the metadata environment accurately as part of new ETL development.
- Worked extensively on different types of transformations like Source qualifier, expression, Aggregator, Router, filter, update strategy, lookup, sorter,, sequence generator, etc.
- Created Mappings with shared objects/Reusable Transformations/Mapplet using Mapping/Mapplet Parameters/Variables.
- Defined the content, structures and quality of high complex data structures using Informatica Data Explore (IDE).
- Extensively used Informatica client tools Source Analyzer, Target/Warehouse designer, Mapping Designer, Transformation Developer, Informatica Workflow Manager and Informatica Workflow Monitor.
- Created Workflow and Tasks to schedule the loads at required frequency using Workflow Manager.
- Used SQL tools like SQL Developer to run SQL queries and validate the data. Tuning the Mappings for Optimum Performance
- Created Test cases for Unit Test, System Integration Test and UAT to check the data quality. Extensively used HP Quality Center for bug fixing. Assists team members in performance tuning and troubleshooting ETL processes under development.
Environment: Environment: Informatica 9.5, Oracle 11g, 5.3.1, Netezza, IDE, IDQ, Delimited Flat Files, SQL developer, PUTTY, Windows 7, Teradata, PL/SQL, SVN, SharePoint
Confidential, New York, NY
Sr Informatica Developer
Responsibilities:
- Modified and developed the maps/workflows for three data feeds viz ADMARC /TAS (BOSTON GLOBE)/INSI.
- Documented the transformation specs via excel sheets and also through SQL queries.
- Reverse engineered and corrected the existing functional specs.
- Used INFORMATICA to extract data from flat files/mainframes and load it into relational target (ORACLE)/FLAT FILES.
- Responsible to tune ETL procedures and star schemas to optimize load and query performance.
- Developed sessions using Power Center, Workflow Manager for loading the data into target database
- Worked on Performance Tuning of various mappings.
- Developed sessions using Workflow Manager for loading the data into target Database / flat files.
- Developed stored procedures and implemented them in the Maps using the Stored Procedure Transformation.
- Wrote UNIX and PL/SQL scripts for pre & post session processes and to automate daily loads
- Developed source and target reconciliation queries to test and validate the maps in order to ensure the maps function in accordance to the business rules.
- Utilized Selenium IDE automation tool, created build.xml files, html wrapper scripts and used Ant to run the automated tests as a full suite.
- Analyzed huge data sets with billions of data on claims, members and providers (health care data) to measure the exact impact ofinaccurate data.
- Expertise in execution, maintenance & debugging of the scripts with Teradata BTEQ.
- Developed new BTEQ scripts and Informatica mappings to fix the impacted data.
- Prepared high level design document from initial stage for each assigned RIM (Request Information Management), presented it to business users and successfully got approved for those RIMs.
- Accomplished in System Integrated Testing (SIT),Regression Testing (RT) for the assigned projectsthroughWLM (Work Load Manager) tool with zero defects.
- Informatica MDM and ETL Admin activities including security and code promotion.
- Scheduling and supporting MDM HUB database weekly backups.
- Developed validation queries which were also used to develop reports as specified by the users.
- These reports were categorized into pre -transformation, cross- table reports (look up tables) and post-transformation reports: more like a before and after picture so that the users can know what to expect.
- Developed narratives that describe the technical process in business terms.
Environment: Informatica Power Center 7.1.3/ 8.1 Power Exchange, Informatica Multi Domain MDM 9.5.1, IDQ, Oracle 10g, MS Access, Excel, XML, Unix/Windows XP
Confidential
Software Developer
Responsibilities:
- Used Informatica workflow manager to execute the workflows and Informatica workflow monitor to monitor the status of the workflows.
- Developed standard and re-usable mappings and mapplets using various transformations like expression, aggregator, joiner, source qualifier, router, lookup, and filter.
- Configured the mappings to handle the updates to preserve the existing records using Update Strategy Transformation.
- Developed Test Cases for ETL/ Data warehouse changes based on technical specifications and Mapping documents
- Tested Complex ETL Mappings and Sessions based on business user requirements and business rules to load data from source flat files and RDBMS tables to target tables.
- Validated Informatica extraction, transformation and loading process by writing SQL against Business rules.
Environment: Informatica PowerCenter 8.1, Oracle 9i, SQL Server, Flat Files, TOAD, UNIX.
Confidential
Responsibilities:
- Analysed source systems, business requirements and identified business rules for building the data warehouse.
- Extensively used Informatica to load data from Oracle, XML and Flat Files to Oracle.
- Used Informatica workflow manager, monitor, and repository manager to execute and monitor workflows and assign user privileges.
- Set up Metadata driven utility design for the ETL processes using Informatica.
- Used debugger to test the mapping and fixed the bugs.
- Involved in tuning the performance of sessions and mappings.
- Used the Workflow manager to create workflows and tasks, and also created worklets.
- Involved in Production Support in resolving issues and bugs.
- Worked on SQL stored procedures, functions and packages in Oracle.
- Scheduled and executed batch and session jobs on Autosys.
- Created and maintained UNIX shell scripts for pre/post session operations and various day-to-day operations.
Environment: Informatica Power Center 6.0 Informatica Power Center 6.1.1/7.0, Oracle8i, PL/SQL, Business Objects 5.1, Windows NT, HPUX 10.20, Erwin 3.5.2, Autosys.
Confidential
Responsibilities:
- Created Informatica mappings to capture volume and data analysis to implement data quality across source and destination systems.
- Regular data analysis for data quality failures, perform analysis along with the business contacts to identify the root cause and partner with technical / business experts to remediate the issue
- Write queries to cross verify business rules validation and create workflows to send data issue alerts to respective team's BA.
- Daily Status call with client to report data issues and gathering requirement for new loops to implement data quality across all source and destination systems.
- Analysis of the specifications provided by the clients.
- Design, Data extraction, transformation and loading.
- Testing, validation and implementation of delivered components.
- Provide support for System testing and Integration testing.
- Responsible for overseeing the Quality procedures related to the project.
- Design and improve process to reduce the ETL window period. Tune ETL code & Queries for better performance.
Environment: Informatica Power Center 6.0, Oracle, MS SQL Server,Oracle8i.