Hadoop Developer Resume
New York City, NY
SUMMARY:
- Around 8 years of experience in various IT sectors such as banking, health - care, and financial services, which includes hands-on experience in Big Data technologies.
- 3 years of experience as a Hadoop Developer in all phases of Hadoop and HDFS development.
- Hands on experience with HDFS, MapReduce and Hadoop Ecosystem (Pig, Hive, Oozie, Hbase, Zookeeper and Sqoop).
- Well versed with developing and implementing MapReduce jobs using Hadoop to work with Big Data.
- Have experience with Spark processing Framework such as Spark and Spark Sql.
- Experience in NoSQL databases like HBase, MongoDB
- Experience on Microservices, Spring boot and Spring cloud.
- Procedural knowledge in cleansing and analyzing data using HiveQL, Pig Latin, and custom MapReduce programs in Java.
- Experienced in writing custom UDFs and UDAFs for extending Hive and Pig core functionalities.
- Ability to develop Pig UDF'S to pre-process the data for analysis.
- Experience in importing and exporting data using Sqoop from HDFS to Relational Database Systems (RDBMS), Teradata and vice versa.
- Skilled in creating workflows using Oozie for cron jobs.
- Strong experience in Hadoop Administration and Linux.
- Experienced with Java API and REST to access HBase data.
- Worked extensively with Dimensional modeling, Data migration, Data cleansing, Data profiling, and ETL Processes features for data warehouses.
- Hands on experience in application development using Java, RDBMS, and Linux shell scripting.
- Hands on experience in PERL Scripting and Python.
- Experience on AWS EC2, EMR, LAMBDA and Cloud Watch
- Experience working with JAVA, J2EE, JDBC, ODBC, JSP, Java Eclipse.
- Extensive experience with SQL, PL/SQL and database concepts.
- Expertise in debugging and optimizing Oracle and java performance tuning with strong knowledge in Oracle 11g and SQL.
- Good experience working with Distributions such as MAPR, Horton works and Cloudera.
- Experience in all stages of SDLC (Agile, Waterfall), writing Technical Design document, Development, Testing and Implementation of Enterprise level Data mart and Data warehouses.
- Having good knowledge on Hadoop Administration like Cluster configuration, Single Node Configuration, Multi Node Configuration, Data Node Commissioning and Decommissioning, Name Node Backup and Recovery, HBase, HDFS and Hive Configuration, Monitoring clusters, Access control List.
- Good Inter personnel skills and ability to work as part of a team. Exceptional ability to learn, master new technologies and to deliver outputs in short deadlines.
TECHNICAL SKILLS:
Technology: Hadoop Ecosystem /Spring Boot/Microservices/AWS /J2SE/J2EE/Oracle
Operating Systems: WindowsVista/XP/NT/2000Series, UNIX/LINUX (Ubuntu, CentOS, Redhat).
DBMS/Databases: DB2, My SQL, SQL, PL/SQL
Programming Languages: C, C++, JSE, XML, Spring, HTML, JavaScript, jQuery, Web services.
Big Data Ecosystem: HDFS, Map Reduce, Oozie, Hive/Impala, Pig, Sqoop, Zookeeper and HbaseSpark, Scala
Methodologies: Agile, Water Fall
NOSQL Databases: Mongo DB, Hbase
Version Control Tools: SVN, CVS, VSS, PVCS
WORK EXPERIENCE:
Confidential, New York City, NY
Hadoop Developer
Responsibilities:
- Involved in loading and transforming large sets of structured, semi structured and unstructured data from relational databases into HDFS using Sqoop imports.
- Developed Sqoop scripts to import export data from relational sources and handled incremental loading on the customer, transaction data by date.
- Migrated existing java application into microservices using spring boot and spring cloud.
- Working knowledge in different IDEs like Eclipse, Spring Tool Suite.
- Working knowledge of using GIT, ANT/Maven for project dependency / build / deployment.
- Developed simple and complex MapReduce programs in Java for Data Analysis on different data formats.
- Developed Spark code using Scala and Spark-SQL/Streaming for faster testing and processing of data.
- Worked as a part of AWS build team.
- Create, configure and managing S3 bucket(storage).
- Experience on AWS EC2, EMR, LAMBDA and Cloud Watch.
- Import the data from different sources like HDFS/Hbase into Spark RDD.
- Experienced with batch processing of data sources using Apache Spark and Elastic search.
- Experienced in implementing Spark RDD transformations, actions to implement business analysis
- Migrated Hive QL queries on structured into Spark QL to improve performance
- Optimized MapReduce Jobs to use HDFS efficiently by using various compression mechanisms.
- Worked on partitioning HIVE tables and running the scripts in parallel to reduce run-time of the scripts.
- Worked on Data Serialization formats for converting Complex objects into sequence bits by using AVRO, PARQUET, JSON, CSV formats.
- Responsible for analyzing and cleansing raw data by performing Hive/Impala queries and running Pig scripts on data.
- Administration, installing, upgrading and managing distributions of Hadoop, Hive, Hbase.
- Involved in performance of troubleshooting and tuning Hadoop clusters.
- Created Hive tables, loaded data and wrote Hive queries that run within the map.
- Implemented business logic by writing Hive UDFs in Java.
- Developed Shell scripts and some of Perl scripts based on the user requirement.
- Wrote XML scripts to build OOZIE functionality.
- Used OOZIE Operational Services for batch processing and scheduling workflows dynamically.
- Extensively worked on creating End-End data pipeline orchestration using Oozie.
- Built Data set, Lens and visualization charts/graphs in the PLATFORA environment.
- Evaluated suitability of Hadoop and its ecosystem to the above project and implementing / validating with various proof of concept (POC) applications to eventually adopt them to benefit from the Big Data Hadoop initiative.
Environment: Map Reduce, HDFS, Spring Boot, Microservices, AWS, Hive, Pig, SQL, Sqoop, Oozie, Shell scripting, Cron Jobs, Apache Kafka, J2EE.
Confidential, BostonHadoop Developer
Responsibilities:
- Responsible for Installation and configuration of Hive, Pig, Sqoop, Flume and Oozie on the Hadoop cluster.
- Developed Sqoop scripts to import export data from relational sources and handled incremental loading on the customer, transaction data by date.
- Build microservices for the delivery of software products across the enterprise.
- Develop strategy for integrating internal security model into new projects with Spring Security and Spring Boot.
- Developed simple and complex MapReduce programs in Java for Data Analysis on different data formats.
- Involved in moving all log files generated from various sources to HDFS for further processing through Flume.
- Developed Spark code using Scala and Spark-SQL/Streaming for faster testing and processing of data.
- Import the data from different sources like HDFS/Hbase into Spark RDD.
- Experienced with batch processing of data sources using Apache Spark and Elastic search.
- Experienced in implementing Spark RDD transformations, actions to implement business analysis
- Responsible for analyzing and cleansing raw data by performing Hive queries and running Pig scripts on data.
- Installing, Upgrading and Managing Hadoop Clusters
- Used OOZIE Operational Services for batch processing and scheduling workflows dynamically.
- Administration, installing, upgrading and managing distributions of Hadoop, Hive, Hbase.
- Advanced knowledge in performance troubleshooting and tuning Hadoop clusters.
- Migrated Hive QL queries on structured into Spark QL to improve performance
- Optimized MapReduce Jobs to use HDFS efficiently by using various compression mechanisms.
- Worked on partitioning HIVE tables and running the scripts in parallel to reduce run-time of the scripts.
- Worked on Data Serialization formats for converting Complex objects into sequence bits by using AVRO, PARQUET, JSON, CSV formats.
- Extensively worked on creating End-End data pipeline orchestration using Oozie.
- Experience on Amazon Web Services(AWS), Amazon Cloud Services like Elastic Compute Cloud(EC2), Simple Storage Service(S3), Elastic Map Reduce(EMR) Amazon Simple DB, Amazon Cloud Watch, SNS, SQS, LAMBDA.
- Evaluated suitability of Hadoop and its ecosystem to the above project and implementing / validating with various proof of concept (POC) applications to eventually adopt them to benefit from the Big Data Hadoop initiative.
Environment: Spring Boot, Microservices, AWS, Map Reduce, HDFS, Hive, Pig, SQL, Sqoop, Oozie, Shell scripting, Cron Jobs, Perl scripting, Apache Kafka, J2EE.
Confidential, South CarolinaHadoop Developer
Responsibilities:
- Responsible for Installation and configuration of Hive, Pig, Sqoop, Flume and Oozie on the Hadoop cluster.
- Involved in design Cassandra data model, used CQL (Cassandra Query Language) to perform CRUD operations on Cassandra file system
- Involved in moving all log files generated from various sources to HDFS for further processing through Flume.
- Involved in loading and transforming large sets of structured, semi structured and unstructured data from relational databases into HDFS using Sqoop imports.
- Developed Sqoop scripts to import export data from relational sources and handled incremental loading on the customer, transaction data by date.
- Developed simple and complex MapReduce programs in Java for Data Analysis on different data formats.
- Developed Spark scripts by using Scala Shell commands as per the requirement.
- Developed and implemented core API services using Scala and Spark.
- Optimized MapReduce Jobs to use HDFS efficiently by using various compression mechanisms.
- Worked on partitioning HIVE tables and running the scripts in parallel to reduce run-time of the scripts.
- Worked on Data Serialization formats for converting Complex objects into sequence bits by using AVRO, PARQUET, JSON, CSV formats.
- Responsible for analyzing and cleansing raw data by performing Hive queries and running Pig scripts on data.
- Installing, Upgrading and Managing Hadoop Clusters
- Administration, installing, upgrading and managing distributions of Hadoop, Hive, Hbase.
- Advanced knowledge in performance troubleshooting and tuning Hadoop clusters.
- Created Hive tables, loaded data and wrote Hive queries that run within the map.
- Implemented business logic by writing Pig UDF's in Java and used various UDFs from Piggybanks and other sources.
- Used OOZIE Operational Services for batch processing and scheduling workflows dynamically.
- Extensively worked on creating End-End data pipeline orchestration using Oozie.
- Populated HDFS and Cassandra with huge amounts of data using Apache Kafka.
- Processed the source data to structured data and store in NoSQL database Cassandra.
- Created alter, insert and delete queries involving lists, sets and maps in Cassandra.
- Design and develop JAVA API (Commerce API) which provides functionality to connect to the Cassandra through Java services.
- Responsible for continuous monitoring and managing Elastic MapReduce cluster through AWS console.
- Evaluated suitability of Hadoop and its ecosystem to the above project and implementing / validating with various proof of concept (POC) applications to eventually adopt them to benefit from the Big Data Hadoop initiative.
Environment: Map Reduce, HDFS, Hive, Pig, HBase, SQL, Sqoop, Flume, Oozie, Apache Kafka, Zookeeper, J2EE, Eclipse, Cassandra.
ConfidentialJava / J2EE Developer
Responsibilities:
- Developed the application using Struts Framework that leverages classical Model View Layer (MVC) Architecture UML diagrams like use cases, class diagrams, interaction diagrams (sequence and collaboration) and activity diagrams were used
- Gathered business requirements and wrote functional specifications and detailed design documents
- Extensively used Core Java, Servlets, JSP and XML
- Designed the logical and physical data model, generated DDL scripts, and wrote DML scripts for Oracle 9i database
- Implemented Enterprise Logging service using JMS and apache CXF.
- Developed Unit Test Cases, and used JUNIT for unit testing of the application
- Implemented Framework Component to consume ELS service.
- Implemented JMS producer and Consumer using Mule ESB.
- Wrote SQL queries, stored procedures, and triggers to perform back-end database operations
- Designed Low Level design documents for ELS Service.
- Developed SQL stored procedures and prepared statements for updating and accessing data from database.
- Development carried out under Eclipse Integrated Development Environment (IDE).
- Used JBoss for deploying various components of application.
- Involved in Unit testing, Integration testing and User Acceptance testing.
- Utilizes Java and SQL day to day to debug and fix issues with client processes.
Environment: Java, spring core, JBoss, JUNIT, JMS, JDK, SVN, Maven, Servlets, JSP and XML
ConfidentialJava / J2EE Developer
Responsibilities:
- Used AGILE methodology for developing the application.
- As part of the lifecycle development prepared class model, sequence model and flow diagrams by analyzing Use cases using Rational Tools.
- Extensive use of SOA Framework for Controller components and view components.
- Involved in writing the exception and validation classes using Struts validation rules.
- Involved in writing the validation rules classes for general server side validations for implementing validation rules as part observer J2EE design pattern.
- Used OR mapping tool Hibernate for the interaction with database. Involved in writing Hibernate queries and Hibernate specific configuration and mapping files.
- Developed EJB tier using Session Facade, Singleton and DAO design patterns, which contains business logic, and database access functions.
- Involved in developing JSP pages and custom tag for presentation layer in Spring framework. Developed tabs with rollover effect using JavaScript plus other effects.
- Developed web services using SOAP and WSDL with Apache Axis 2.
- Developed, implemented, and maintained an asynchronous, AJAX based rich client for improved customer experience using XML data and XSLT templates.
- Developed SQL stored procedures and prepared statements for updating and accessing data from database.
- Development carried out under Eclipse Integrated Development Environment (IDE).
- Used JBoss for deploying various components of application.
- Used JUNIT for testing and check API performance. Involved in fixing bugs and minor enhancements for the front-end modules. Responsible for troubleshooting issues, monitoring and guiding team members to deploy and support the product.
- Used SVN Version Control for Project Configuration Management.
- Worked with the Android SDK, and implemented Android Bluetooth and Location Connectivity components.
- Deploying applications in AppServers for DEVL, ALPHA and Beta integration environments.
- Worked with business and System Analyst to complete the development in time.
- Implemented the presentation layer with HTML, CSS and JavaScript.
- Developed web components using JSP, Servlets and JDBC
- Implemented secured cookies using Servlets.
- Wrote complex SQL queries and stored procedures.
- Implemented Persistent layer using Hibernate API
- Implemented Transaction and session handling using Hibernate Utils.
- Implemented Search queries using Hibernate Criteria interface.
- Provided support for loans reports for CB&T
- Designed and developed Loans reports for Evans bank using Jasper and iReport.
- Involved in fixing bugs and unit testing with test cases using Junit
- Resolved issues on outages for Loans reports.
- Maintained Jasper server on client server and resolved issues.
- Actively involved in system testing.
- Fine tuning SQL queries for maximum efficiency to improve the performance
- Designed Tables and indexes by following normalizations.
- Involved in Unit testing, Integration testing and User Acceptance testing.
- Utilizes Java and SQL day to day to debug and fix issues with client processes.
Environment: Java, Servlets, JSP, Hibernate, Junit Testing, Oracle DB, SQL, Jasper Reports, iReport, Maven, Jenkins.
 Privacy Policy
Privacy Policy
                                