We provide IT Staff Augmentation Services!

Junior Data Scientist Resume

4.00/5 (Submit Your Rating)

CA

TECHNICAL SKILLS

  • Tableau, D3.js, Power BI Python, R, Java, JavaScript, SAS
  • Apache Kafka, MapReduce, Apache Spark, Apache Hive
  • Apache Spark MLlib SQL, MySQL, PostgreSQL, SQLite, MongoDB, Cassandra
  • Descriptive, Inferential (Estimation, Hypothesis Testing, t - tests, ANOVA, Correlation, Regression, X A 2)
  • MS SQL Server, SQL (Oracle 10g), MySQL, Teradata
  • Google Cloud Platform (Dataprep, SQL, BigQuery, ML Engine, Pub/Sub, Dataproc, Dataflow)
  • Analytic/ Visualization Tools
  • Languages
  • Big Data/ Hadoop
  • RDBMS/ NoSQL
  • Statistics
  • Databases
  • Cloud Platforms

PROFESSIONAL EXPERIENCE

Confidential, CA

Junior Data Scientist

Responsibilities:

  • Built machine learning model and made predictions using REST API calls for a SAAS product
  • Engineered deep and wide learning model using Tensorflow on Google Cloud Platform streaming over 1 TB of data for Dynamic Pricing module of teh SAAS product; reduced teh model error by 20% and training time by 30%
  • Devised Time Series ARIMA model to implement Demand Forecasting module for teh SAAS product
  • Fabricated an optimized ETL pipeline on both streaming and batch mode data from existing data warehouse to BigQuery
  • Designed optimized queries to extract useful data to identify data patterns for deciding an algorithm to improve teh forecast accuracy
  • Engineered Multivariate LSTM time series model using Keras in Python and measured teh accuracy of this model using RMSE
  • Compared teh outputs with teh actual demand by developing dashboards using Google Data Studio; found dat LSTM is more accurate

Confidential, CA

Junior Data Scientist

Responsibilities:

  • Performed large scale processing, built an optimized ETL pipeline for marketing contacts data using Apache Spark; designed optimized queries using HiveQL to understand teh data patterns for deciding an algorithm to enrich marketing contacts data
  • Devised Locality Sensitive Hashing algorithm to eliminate teh problem of orphan node by clustering teh similar nodes
  • Provided visibility in teh TEMPeffectiveness of marketing campaigns to senior management by preparing executive dashboards in Tableau
  • Performed a gap analysis to reconcile reporting data between CISCO'S legacy Oracle BI tool and its new SAP HANA based BI initiatives

Confidential

Business Analyst

Responsibilities:

  • Fabricated a business process for analyzing and processing audit data and financial data of housing societies; reported to teh manager teh key findings in teh process
  • Interacted with clients to define KPI's; built dashboards using Tableau for analyzing and summarizing fiscal data related to KPI's
  • Identified issues and areas for improvement, provided creative alternatives to improve business process efficiency by 30%
  • Developed a single layer neural network model using pandas, numpy, scikit-learn, statsmodels, back-propagation & gradient descent
  • Performed dataset cleaning and processing; used L2 regularization and dropout methods to overcome overfitting

We'd love your feedback!