Junior Data Scientist Resume
4.00/5 (Submit Your Rating)
CA
TECHNICAL SKILLS
- Tableau, D3.js, Power BI Python, R, Java, JavaScript, SAS
- Apache Kafka, MapReduce, Apache Spark, Apache Hive
- Apache Spark MLlib SQL, MySQL, PostgreSQL, SQLite, MongoDB, Cassandra
- Descriptive, Inferential (Estimation, Hypothesis Testing, t - tests, ANOVA, Correlation, Regression, X A 2)
- MS SQL Server, SQL (Oracle 10g), MySQL, Teradata
- Google Cloud Platform (Dataprep, SQL, BigQuery, ML Engine, Pub/Sub, Dataproc, Dataflow)
- Analytic/ Visualization Tools
- Languages
- Big Data/ Hadoop
- RDBMS/ NoSQL
- Statistics
- Databases
- Cloud Platforms
PROFESSIONAL EXPERIENCE
Confidential, CA
Junior Data Scientist
Responsibilities:
- Built machine learning model and made predictions using REST API calls for a SAAS product
- Engineered deep and wide learning model using Tensorflow on Google Cloud Platform streaming over 1 TB of data for Dynamic Pricing module of teh SAAS product; reduced teh model error by 20% and training time by 30%
- Devised Time Series ARIMA model to implement Demand Forecasting module for teh SAAS product
- Fabricated an optimized ETL pipeline on both streaming and batch mode data from existing data warehouse to BigQuery
- Designed optimized queries to extract useful data to identify data patterns for deciding an algorithm to improve teh forecast accuracy
- Engineered Multivariate LSTM time series model using Keras in Python and measured teh accuracy of this model using RMSE
- Compared teh outputs with teh actual demand by developing dashboards using Google Data Studio; found dat LSTM is more accurate
Confidential, CA
Junior Data ScientistResponsibilities:
- Performed large scale processing, built an optimized ETL pipeline for marketing contacts data using Apache Spark; designed optimized queries using HiveQL to understand teh data patterns for deciding an algorithm to enrich marketing contacts data
- Devised Locality Sensitive Hashing algorithm to eliminate teh problem of orphan node by clustering teh similar nodes
- Provided visibility in teh TEMPeffectiveness of marketing campaigns to senior management by preparing executive dashboards in Tableau
- Performed a gap analysis to reconcile reporting data between CISCO'S legacy Oracle BI tool and its new SAP HANA based BI initiatives
Confidential
Business Analyst
Responsibilities:
- Fabricated a business process for analyzing and processing audit data and financial data of housing societies; reported to teh manager teh key findings in teh process
- Interacted with clients to define KPI's; built dashboards using Tableau for analyzing and summarizing fiscal data related to KPI's
- Identified issues and areas for improvement, provided creative alternatives to improve business process efficiency by 30%
- Developed a single layer neural network model using pandas, numpy, scikit-learn, statsmodels, back-propagation & gradient descent
- Performed dataset cleaning and processing; used L2 regularization and dropout methods to overcome overfitting