Scala / Spark Developer Consultant Resume
SUMMARY:
- Diversified Software Development experience in getting the Software Pieces to work together.
- Hands on experience in Big Data / Hadoop (MapR, Hortonworks, Cloudera, AWS), Map Reduce, Pig, Hive, Oozie, Kafka, Flume, Sqoop
- Experience in Apache Spark Programming
- Experience in Scala - Functional Programming
- NoSQL Databases - HBase, Cassandra
- C++, Object Oriented Programming, STL
- Extensive experience in UNIX programming, UNIX Shell Scripting, UNIX Batch Processing
- Experience in Software Methodologies like Waterfall, Agile XP/SCRUM
- Experience of Oracle 10g, 11i, DB2, SQL Server 2005/2008, MySQL
- Experience in J2EE Design Patterns, Model-View Controller design paradigm and Struts Framework
- Data Structures & Algorithms
- UNIX TCP/IP IPC (Socket) Programming in C, C++ Multithread, GDB (debugger) using in C/C++
- SQL, PL/SQL, Stored Procedures
- UNIX Shell Scripting, Perl, Python
- Production Support, Maintenance, Enhancement, Debug activities
- Version Control Tools: GIT, Clear Case, SVN, CVS, Microsoft Visual SourceSafe
- Tools: Eclipse 3.0, Toad 7.4, Ant 1.5, Maven, Log4j, Jenkins, JUnit
TECHNICAL SKILLS:
Big Data Technologies: HADOOP/SPARK, Scala, Akka, MapR, Hortonworks, HDFS, Pig, Hive, Java Map Reduce, HBase, Sqoop, Flume, Apache Kafka, Oozie
Internet Technologies: Java (JDBC 2.0), J2EE (Servlets 2.1, JSP 1.2), XML
App Servers/ Middleware: Tomcat Application Server
IDE / Programming Tools: Eclipse 3.0, Toad 7.4, Maven, Log4j, JUnit
Operating System: UNIX, Linux, HP-UX, AIX, CentOS, Sun Solaris, Ubuntu, Windows 2000/NT/XP
Database/Tools: JDBC, Oracle10g / 9i, XML
Version Control: GIT, Clear Case, SVN, CVS
PROFESSIONAL EXPERIENCE:
Confidential
Scala / Spark Developer Consultant
Responsibilities:
- Developed modules for the aggregation of data sets in Apache Spark using Scala to process large data sets like Chat logs, Fibernode data sets, and Property Labs data sets.
- Involved in developing legacy Java MapReduce code to process the data sets in the existing data pipelines.
- Involved in writing scripts using PIG, HIVE and involved in creating workflows using Oozie.
- Involved in big data ingestion and streaming tools like Apache Flume, Sqoop.
- Processed data by loading from NoSQL Database like Cassandra into Spark application for data processing and dumping the final spark application output into Cassandra data store.
- Involved in peer-to-peer code reviews.
- Involved in maintaining the code modules using GIT.
Environment: Apache Spark 2.0, Scala, Akka, Hadoop (Hortonworks), Spark Streaming, SparkSQL, AWS, HDFS, EMR, Hive, MapReduce, ORC, JSON, Parque, Sqoop, Kafka, Ambari, Tableau, Shell Scripting, Linux.
Confidential, MN
Big Data Software Consultant
Responsibilities:
- Developed extensive Apache Spark DataFrame / SparkSQL appliactions using Apache Spark and Scala.
- Involved in developing modules using PIG, HIVE, Spark, Scala, Sqoop.
- Developed, built and maintained code modules using GIT and Jenkins.
- Wrote extensive Java UDF functions using Distributed Cache functionality of PIG to distribute ICD10 frequency codes to the map tasks running on data nodes.
- Involved in developing the PIG script by making use of Java UDF’s in the form of Jar files to develop the data sets for ICD10 based on claim headers.
- Developed Java MapReduce code and PIG scripts for Hadoop Hortonworks and MapR platforms.
- Developed scripts in HIVE and stored the data in HIVE not only in text format, but also in binary format using ORC format.
Environment: Apache Spark, Scala, Hadoop (MapR, Hortonworks), Spark Streaming, EMR, HDFS, Pig, Hive, MapReduce, AWS, Kafka, Sqoop, HBase, Ambari, Tableau, MySQL, UNIX Shell Scripting, Linux.
Confidential, CA
Software Engineer
Responsibilities:
- Developed multiple MapReduce jobs in java for data cleaning and preprocessing.
- Developed Simple to complex MapReduce Jobs using Hive and Pig.
- Analyzed large data sets by running Hive queries and Pig scripts.
- Involved in creating Hive tables, and loading and analyzing data using hive queries.
- Involved in loading data from LINUX file system to HDFS.
- Load and transform large sets of structured, semi structured and unstructured data.
- Developed majority of C/C++ modules which will serve complete functionality of the predicted subscriber totals application.
- Assisted in exporting analyzed data to relational databases using Sqoop.
- Followed Agile methodology.
- Involved in code reviews.
Environment: Hadoop, HDFS, Scala, Pig, Hive, Java MapReduce, Sqoop, HBase, Python, CentOS/Ubuntu/LINUX, C /C++, Java, SQL, PL/SQL, Binary Search Trees, Binary Search, Inheritance, Polymorphism, Oracle, MySQL, UNIX Shell Scripting, STL, GDB, UNIX Scripting, Linux.
Confidential, NJ
Senior Software Engineer
Responsibilities:
- Experience working on processing unstructured data using Pig and Hive.
- Collecting and aggregating large amounts of log data using Apache Flume into HDFS for further analysis.
- Worked on debugging, performance tuning of Hive & Pig Jobs.
- Involved in loading data from LINUX file system to HDFS.
- Importing and exporting data into HDFS and Hive using Sqoop.
- Supported MapReduce Programs those are running on the cluster.
- Worked as C/C++ developer and involved in fixing the production code defects in various C/C++ applications.
- Prepared low level detail design document from Functional spec document.
- Extensively used GDB debugger to debug the C code to resolve the coredump issues.
- Involved in fixing build issues in various C/C++ applications.
- Designed and developed C/C++ Boost Multithreaded Consumer Enhancement Appendage Server using Boost Threads, Boost Property Maps, Boost Dynamic Bitsets, Program Options to pull the consumer appendages data records up to 60 million records from USPS and various mailing data sources.
Environment: Hadoop, HDFS, Pig, Hive, MapReduce, Sqoop, CentOS/LINUX, Big Data, C/C++, Python, Inheritance, Polymorphism, Design Patterns, Data Structures & Algorithms (Binary Trees, Binary Search Trees, AVL Trees), Boost, Boost Program Options, Boost Thread Libraries, Boost Chrono, Boost Dynamic Bitset, Boost Property Tree, Oracle, MySQL, UNIX Shell Scripting, STL, SVN, GitHub, GDB, Linux.
Confidential, FL
Senior Software Consultant
Responsibilities:
- Worked as C/C++/UNIX Senior Software Developer in E*Seamless Availability (Airline Ticket Reservation Industry).
- Designed and developed store and search modules in C to reformat reservation Bid Price data sent by EDW (Enterprise Data Warehouse). The EDW driver program creates data files with the data sent by EDW.
- Developed Bid Price data cleanup modules in C using binary search trees data structures to clean up the Bid Price data for various flights for the next available year dates.
- Performed the troubleshooting, maintenance activities to resolve the defects in E*Seamless Availability applications.
Environment: C/C++, Pro*C, STL, Linux, Oracle, Golden, Web Services, UNIX Shell Scripting, gdb, Binary Search Trees data structures, UNIX Socket API, Sed/AWK, Perl, Oracle 9i, SVN.
Confidential
Software Consultant
Responsibilities:
- Development of service layer using Spring Framework, Hibernate, Java/J2EE technologies.
- Modeled application using use case diagrams, class diagrams and sequence diagrams.
- Written DAO Interfaces using Spring MVC framework that uses ORM model Hibernate to connect to Oracle Database to store and retrieved data.
- Involved in Batch Processing of Car Fleet Management data files using Shell Scripts on UNIX platform thru Autosys.
Environment: Java/J2EE, Spring Framework, Hibernate, Servlets, CSS, Rational Rose, Ant, TOAD, Rational Clear Quest, Oracle 10g, XSV, JUNIT, AIX, Sun Solaris 2.6, and Windows XP, Apache Tomcat, Oracle Developer Studio, UNIX, Shell Scripts, XML, Windows 7, Microsoft Office Suite.
Confidential
Software Consultant
Responsibilities:
- Developed Multithreaded applications in C/C++/Pro*C for Bill Payment, Electronic Billing, Payment Batch Processing domains on Linux/Solaris platforms.
- Involved in developing Confidential Credit / Debit card payment processing modules using Java Strut Framework, JSP, JDBC, Servlets, Tomcat, Log4J, and DB2.
- Written Ant Build Scripts to build the EAR and WAR.
- Written Database Stored Procedures in Oracle PL/SQL.
- Wrote Batch Processing UNIX Shell Scripts for Credit/Debit card payment data files on UNIX platform and launched them thru cron jobs.
- Performed the troubleshooting, maintenance activities to resolve the defects in Bill Payment, Electronic Billing applications.
Environment: Java/J2EE, C/C++, Pro*C, Struts Framework, JDBC, Servlets, Linux, POSIX Multithreading, UNIX Sockets, Makefiles, Rational Rose, Ant, TOAD, Rational Clear Quest, Oracle 10g, JUNIT, AIX, Sun Solaris 2.6, Tomcat application server, Oracle Developer Studio, UNIX, Shell Scripts, XML, Windows 7, Microsoft Office Suite, UNIX Shell Scripting, Sed/AWK, SVN, GDB.
Confidential, OK
Software Consultant
Responsibilities:
- Call processing modules developed in C/C++ fetch Flat Rate details of Telephone Service from Sybase database and creates hash maps in main memory.
- Created JSP pages that contains Strut Framework, Servlets, JSP, Tomcat, and Log4J.
- Developed Struts Action Forms and Action classes for validation and business logic respectively.
- Written Action Forms that represents business profiles that is set or get by the HTML form.
- Written Action Classes that performs actions for doing Action Forwards.
- Wrote PL/SQL procedures to pull the data from backend Oracle database to generate Rating/Billing Reports.
Environment: Java/J2EE, Rational Rose, Ant, TOAD, Clear Quest, Oracle 10g, JUNIT, AIX, Sun Solaris 2.6, Oracle Developer Studio, UNIX Shell Scripts, C++, Xerces XML C++ Parser API, POSIX IPC Pipes, UNIX Sockets, POSIX Multithreading, AVL Binary Trees, Binary Search, Inheritance, Polymorphism, XML, Windows 7, Microsoft Office Suite.
Confidential, NY
Software Consultant
Responsibilities:
- Worked as deployment engineer to deploy jar files of the various Java projects on Apache Tomcat servers and provided production support for online banking applications on Linux environment.
- Designed and developed Web forms for Investment Banking Projects in Perl/CGI, Java Script for capturing Requests data.
Environment: Java, J2EE, Web forms Development, Front page, Java Script, HTML, XML, Oracle J2EE, Java, WebSphere, Apache Tomcat, Korn UNIX Shell Scripting, Sed, AWK, Cron Job, CVS.
Confidential
Programmer Analyst
Responsibilities:
- Part of the team who developed BerkeleyDB Engine in C++/STL/BerkeleyDB C++ API/Xerces XML Parser API, to restore the functionality of Workflow Management System.
- Designed and Developed Worker component for Universal Billing Complex project in ‘C’ using UNIX IPC Sockets, SNAC Protocol to capture and rate Volume Based Billing records generated when the users logs out from AOL application.
- Developed Exception Log Processor in C/Linux to process exception log files generated by Cycle Activity Billing system.
- Performed the troubleshooting, maintenance activities to resolve the defects in Billing and Rating applications.
- Part of the team which involved in the design and development of iSCSI Multithreaded pseudo device driver in C on Linux platform using POSIX Threads, Sockets, to access SCSI devices over the network.
- Designed and developed GTDB Access API for DHL client in Java using Servlets, and JDBC.
- Wrote UNIX Shell Scripts on HP-UX platform to do the numerous code changes in GTDB Java source code files.
- Wrote UNIX shell script to test around 500 GTDB Java APIs automatically by supplying XML input files thru shell scripts.
Environment: C/C++, STL, Xerces XML C++ Parser API, Linux, IPC Sockets, Shell Scripting, Perl, C/C++ Multithreading, Berkeley DB C++ APIs, Rational Rose, UML, iSCSI, J2EE, JSP, JDBC, XML, Ant, Sybase, Stored Procedures, Informix, CVS, VSS.
Confidential
Software Engineer
Responsibilities:
- ERMS (Executive Resource Management System) is developed on Linux in Java using Strut Framework, Servlets, JSP, Tomcat, Log4J, and DB2.
- Developed API wrappers using DES APIs in Visual C++ as a part of healthcare related HIPAA Security Initiative to handle encryption and decryption.
- Wrote extensive Shell Scripts for the installation of ERMS modules on the Linux environment.
Environment: C, VC++, Linux, J2EE, JSP, JDBC, Struts Framework, Tomcat, DB2, DES, Internet Security, TSL/SSL, Encryption, Decryption, UNIX Shell Scripting, Sed, AWK, Stored Procedures, CVS, VSS, Windows NT/2000.
Confidential
Software Engineer
Responsibilities:
- Part of the team who developed a Queuing Engine in C on UNIX platform using UNIX IPC mechanisms (Sockets) which provides a temporary buffering mechanism in EventSolv / EventProg mobile messaging middleware framework.
- Involved in implementing Encryption features for the EventProg messages using Linux C/SSL API of Baltimore C/SSL Toolkit.
- Wrote extensive UNIX Shell Scripts using Sed/Awk for the installation of EventProg modules on several Linux boxes.
Environment: C, Linux, Shell Scripting, IPC Socket Programming, IPC Pipes, POSIX Multithreading, Sed, Internet Security, C/SSL, TSL, CVS.