We provide IT Staff Augmentation Services!

Sr. Big Data/machine Learning Consultant Architect/engineer Resume

3.00/5 (Submit Your Rating)

SUMMARY:

  • Senior Big Data/Machine Learning Engineer/Architect/Technical Lead/Data Scientist with over 15 year experience in client server, multi - tier and distributed and cloud architectures,
  • Experience in Cloud, Big Data, DevOps, Analytics, Business Intelligence, Data mining, Machine learning, Algorithm development, Distributed computing, Programming and Scripting languages,
  • Experience in all aspects of SOA application development (design, coding, testing, deployment and maintenance),
  • Proficiency in various libraries, frameworks and tools,
  • Strong analytical and problem-solving skills,
  • Good writing and communication skills, ability to work both independently and within a team.

TECHNICAL:

Big Data : Spark, AWS, EMR, S3, Kinesis, Lambdas, State-machines, Step-functions, SageMaker, IAM, DynamoDB, Hive, Genie, SQS, SNS, EC2, Spark Sql, Kafka, Scoop, Yarn, HDP, Machine Learning, AI, Cassandra, Druid, Redshift, Hadoop, Azure

DevOps : CI/CD, Terraform, Serverless framework, Shell scripting

Programming : Java/J2EE, Python, Scala, Shell scripting, .NET, C#, JUnit, Json, Parquet, Xml,, Maven, Ant, Jira, Jenkins, Gitlab, Github, Svn, Design patterns, Agile, Security, SSL, LDAP, Swift, Intellij, MS Visual Studio, Eclipse, XCode, Docker, NodeJs, JavaScript, Ajax, Spring, Hibernate, SOA, Rest, Soap, Servlets, Jsp, Jdbc, Jms, Jpa, Mvc, Tomcat, JBoss, Css, Html, AngularJS, Bootstrap, D3, JQuery, C++, STL, Asp

Databases : Hive, Oracle, MySql, Postgres, DynamoDB, Cassandra, Druid, Redshift, HBase, MS SqlServer, Aurora, PI Server (OSISoft)

Tools : AWS Athena, SageMaker, Jupyter, Splunk, Informatica, Tableau, Oracle OBIEE, Matlab, R, Weka, SPSS, Visio, Tibco Spotfire, Sql Developer, Cloudwatch, Airflow, Genie

OS : Linux, Mac, Windows, iOS.

EMPLOYMENT SUMMARY:

Sr. Big Data/Machine Learning Consultant Architect/Engineer

Confidential

Responsibilities:

  • Architect and developed cloud-based real-time , big-data ingestion and analytics components using AWS stack, Spark, EMR, PI Server, Cloudwatch triggers, State machines, Lambda functions,
  • Developed and deployed to production multiple projects in the CI/CD pipeline for real-time data distribution, storage and analytics. Persistence to S3, HDFS, Postgres,
  • CI/CD automation and orchestration of deployment to various environments using Gitlab, CI/CD Pipelines,
  • Real-time data ingestion service creation and enhancement from PI Server (OSISoft) into AWS cloud,
  • Optimization and trouble shooting, test case integration into CI/CD pipeline using docker images,
  • Hiring, mentoring, interfacing with business, end-to-end pipeline ownership, technology evaluation.

Environment: Java/J2EE, Spark, AWS, EMR, S3, Hive, Kinesis, Lambdas, State-machines, Step-functions, Python, EMR, CI/CD, Hadoop, Spark Sql, Athena, IAM, VPC, Json, Parquet, PI Server (OSISoft), DynamoDB, Aurora, Cloudwatch, Intellij, Maven, Scrum, Confluence, Gitlab, Scala, Serverless, Terraform, Shell scripting, C#, Genie, IOT, Machine Learning, AI, SageMaker, Jupyter, Glue, Postgres, Tableau.

Big Data Consultant Architect / Senior Engineer / Data Scientist

Confidential

Responsibilities:

  • Architect, develop cloud-based real-time , big-data ingestion and analytics platform using AWS stack, Spark and EMR for IOT device data,
  • Developed and deployed to production multiple projects in the pipeline for real-time data distribution, storage and analytics. Persistence to S3, HDFS,
  • Designed and developed metric reporting using Cloudwatch and Splunk,
  • Data quality, data modeling, and predictive analytics for profiling device performance and forecasting device failures,
  • Automation and orchestration of deployment to various environments using Jenkins, Airflow and Genie.

Environment: Java/J2EE, Spark, AWS, S3, Hive, Kinesis, Python, PySpark, Cloudwatch, EMR, Predix, Hadoop, Spark Sql, Json, Parquet, DynamoDB, Aurora, Lambda, Intellij, Maven, Scrum, Confluence, Jenkins, Github, Scala, Shell scripting, Groovy, Docker, Splunk, Airflow, Genie, IOT, Machine Learning.

Big Data Consultant Architect / Senior Engineer / Data Scientist

Confidential

Responsibilities:

  • Spark based engine for data processing and validations, rules, persisting, emailing, indexing, search, Salesforce integration, EMR, SQS,
  • Architecting and framework extension with Spark Sql rule-based real time aggregations engine,
  • Integrating data from various sources, persistence based on configuration-driven rules and mappings, flyway migrations,
  • Working with DevOps and QA actively,
  • Benchmarking and performance evaluation,
  • Technical problem solving and guidance, scrum, confluence.
  • Big data ingestion (billions of rows) into Cassandra, HDFS, Druid,
  • Data modeling and data analysis using demographic and activity datasets (Experian, Rentrak, Simmons, DSP)
  • NoSQL and join queries on big data, dimensions, metrics, cross-walks,
  • Audience creation, insights, analysis of ad campaign performance data,
  • Programmatic .

Environment: Java/J2EE, Spark, Cassandra, Druid, Hive, Redshift, MySql, Hadoop, HDFS, AWS, SQS, S3, EMR, Spark Sql, Python, PySpark, Datastax, Imply, Sparkline Data, Spring, Maven, Solr, Salesforce, Linux, Mac, Scrum, Jira, Confluence, Jenkins, Github, SourceTree, Scala, Docker, Flyway, Datorama (data warehouse), Salesforce, Puppet, Splunk.

Consultant Architect/Principal Software Engineer/Data Scientist

Confidential

Responsibilities:

  • Architect, requirements capture, design, develop, test and deploy analytics portal platform,
  • Built tools and functionality in Java/J2EE for business workflows and processes,
  • Built in live analytics capability for monthly, quarterly, annual monitoring via graphs and charts,
  • UI design and process integration to capture data electronically,
  • User-centric metrics, reporting,
  • User mailing, logging, authentication (via LDAP), feedback, announcements, search,
  • ETL (Service and UI for loading data from data feeds, star schema, data warehouse),
  • Sql procedures and functions for business logic,
  • Documentation, design, components and flow.

Environment : Java/J2EE, Oracle OBIEE, Tableau, Informatica, Spotfire, Spring, Hibernate, SOA, Rest, Oracle, MS SqlServer, Mvc, Servlets, Jsp, Jdbc, Dojo, JavaScript, Ajax, Html, Sql, Tomcat, Weblogic, LDAP, Xml, Json, Maven, Svn, Firebug, Css, AngularJS, Bootstrap, Linux, Windows.

Confidential

Senior Consultant

Responsibilities:

  • UI, middle-tier and backend programming for purchase order, invoices and checks processing,
  • Project and vender based expenditure and balances report generation,
  • Scheduling, Integrating data fetching from Oracle E-Business Suit (pl/sql procedures),
  • Data loading, transformation, cleaning, aggregating data,
  • File attachment uploading and linking, filtering and lookup,
  • Deploying on staging and production environments ,
  • Requirements gathering, end-user interaction, presentation and,
  • Bug fixing, debugging and testing.
  • SQL Server Profiling and Performance Tuning and Optimization of Response times.

Environment : Java/J2EE, Oracle OBIEE, Tableau, Informatica, Spring, Hibernate, SOA, Rest, Oracle, MS SqlServer, Mvc, Servlets, Jsp, Jdbc, Dojo, JavaScript, Ajax, Html, Sql, Tomcat, Weblogic, LDAP, Xml, Json, Maven, Svn, Firebug, Css, AngularJS, Bootstrap, Linux, Windows, C#, IIS, Crystal Reports, Visio

Senior Consultant / Senior Software Engineer / Data Scientist

Confidential

Responsibilities:

  • Building e-commerce product to product recommendation models (Java and R) and services,
  • Model evaluation and testing strategies and metrics, AB-Testing.
  • Service development - multi-threaded configuration and catalog refresh, delivery of product recommendations to front end components, multi-threaded session, events logging via in-memory blocking queue for tracking and reporting, configuration-driven application development, real-time model learning and decisioning, traffic-routing to fetch recommendations, quartz scheduling of jobs, log files monitoring and email sending,
  • Oracle real-time decisions - informants, advisors, decisions, models, entities, choices, filtering and statistics.

Environment : Java/J2EE, Oracle RTD, Decision Studio, R, Spring, SOA, Web Services, AOP, Jms, Xml, Json, Maven, Svn, Firebug, Log4J, Sql Developer, OC4J, WebSphere, Pl/Sql, Hadoop, Map-Reduce, Linux, Windows, Hudson, Nexus, Splunk, Scrum.

We'd love your feedback!