Sr Data Analyst Resume
0/5 (Submit Your Rating)
Sfo, CA
SUMMARY
- Over 7 years of experience in Data analysis, Predictive modeling, Machine learning, Statistical analysis with R and SAS programming.
- Proficient in advanced statistics - Analysis of Variance (ANOVA), Linear Regression, Logistics Regression, Multivariate Analysis and statistical Modeling.
- Expert in data mining, exploratory data analysis and Machine Learning (Supervised and Unsupervised) algorithms, K means clustering, Random forests, Decision tree, SVM, PCA, Time-series, Regression and Clustering.
- Well-versed in R Programming using several CRAN packages and data visualization.
- Strong practical experience in Python programming including the statistical libraries such as: Numpy, Scipy, Pandas, Matplotlib, Sklearn and Seaborn.
- Skilled in providing analytic support including data importing, data wrangling and data visualization.
- Able to communicate effectively with multifunctional teams, programmers and technical staff at all levels. Strong customer service orientation.
- Participated in requirements analysis reviews and Design sessions to understand the requirements and designing Reporting Solutions.
- Familiar with fundamentals of Hadoop Architecture, HDFS Framework and components of its ecosystem like Map Reduce and HIVE.
- Committed Team player with excellent communication skills and capable of working independently.
- Participated in requirements analysis reviews and Design sessions to understand the requirements and designing Reporting Solutions.
- Familiar with fundamentals of Hadoop Architecture, HDFS Framework and components of its ecosystem like Map Reduce, Hive, Sqoop and Spark.
- Committed Team player with excellent communication skills and capable of working independently.
TECHNICAL SKILLS
- R
- SAS
- Python
- Hadoop
- Hdfs
- MapReduce
- Hive
- Sqoop
- Apache Spark
- Scala
- NoSql
- Asp.net
- MS Excel.
PROFESSIONAL EXPERIENCE:
Sr Data Analyst
Confidential, SFO, CA
Responsibilities:
- Calculated risk factor for individual clients based on hierarchical demographical information.
- Performed customer analysis, risk and pricing analysis and forecasted results for credit card holders on demographical basis.
- Imported data into suitable data structures, and carried out cleaning and manipulation using to different R packages tidyr, dplyr, reshape etc to create formats as per business requirements.
- Carried out exploratory data analysis and prepared graphs and using the modified tables for analysis.
- Performed factor analysis and Principal Component analysis to reduce the variables and performed cluster analysis.
- Worked extensively on building logistic regression, classification and regression trees, and random forest algorithms for calculating probability of default(PD).
- Proficiency in Spark for loading data from the local file system, Relational and NoSQL databases and using Spark SQL, Import data into RDD and Ingesting data from a range of sources using Spark Streaming
- Generated Reports, Summary tables, Charts and Graphs for different users using R packages.
- Well-versed with data visualization packages ggplot2, leaflet, geosphere, and tableau and d3.
Environment: R, RStudio, Apache Spark, CRAN packages, ggplot2, regression, random forest, decision trees, cross-validation, bootstrapping.
Sr.Data Analyst
Confidential, Charlotte, NC
Responsibilities:
- Performed data cleaning, sorting, merging, data analysis and summarization using SQL, SAS base programming in batch mode.
- Reading data from Excel, Oracle and Raw data from text files and converting them to SAS datasets and or writing to flat files, oracle tables and excel files and generating reports in RTF and HTML formats using Data Null, SAS/ODS, PROC REPORT.
- Worked extensively in data analysis, data mining of raw data, excel data and creating various reports from data marts used in forecasting the response rate.
- Mean, Median, Mode, Data Distributions, Standard Deviation and Variance, Hypothesis Testing (p-values) and Test for significance (z-test, t-test and ANOVA
- Extensively used Proc SQL, Proc DBLOAD, Proc Tabulate, Proc Report, Proc Sort, Proc Freq, Proc Transpose, Proc Summary, Proc Means, Proc Reg, Proc ANOVA, Proc Univariate and Data NULL .
- Built predictive data models in SAS using Linear, Logistic Regression & Time Series forecasting.
Environment: BaseSAS9.2, SAS/Macro, SAS Graph, SAS/STAT, SAS/SQL, SAS/Connect,, SQL, MySQL, Excel
Data Analyst
Confidential, San Francisco, CA
Responsibilities:
- Performed data manipulation and prepared the training and testing sets for modeling.
- Used matplotlib to explore the dataset and to check skewness in the data.
- Removed the skewness and performed normalization of data.
- Performed PCA (principal component analysis) to figure out independent variables having strong relation with target variable.
- Wrote simple and advanced SQL queries for extracting data and created dashboard and stories.
- Utilized libraries such as xgboost & scikit-learn to build statistical model such as Ridge Regression, Lasso Regression, Xgboost Regression & Random Forest.
- Trained the model on training dataset and evaluated root mean square error & accuracy scores to select the best statistical model.
- Performed prediction of house prices on testing dataset.
Environment: R, Python, Jupyter, Microsoft Office, Microsoft Excel.
Data Analyst
Confidential, Peoria, IL
Responsibilities:
- Developed and Analyzed Strategies and Reports for usage in the Management of: Existing Customer Management, Marketing Campaigns, and Collections.
- Responsible for performing customer analytics to make recommendations and identified best offers for monthly mailing program using profiling and clustering techniques.
- Data preparation included variable transformations, missing value imputation, outlier treatment and multi-collinearity.
- Conduced in-depth analysis/mining and interpretation of results of various data-sets to study the customer and product behaviour patterns.
- Created various summary reports using Proc Report, Tabulate and output the reports via ODS HTML to website.
Environment: Advanced Excel (Macros, Pivot tables, graphs), SQL, SAS Reports.
Jr .Net Developer
Confidential
Responsibilities:
- Understand the requirements and review of design docs to ensure adherence to the guidelines defined by the client.
- Designed mockup screens to demonstrate business user's requirements.
- Worked extensively on Microsoft SharePoint designer to modify the master page and to differentiate design, look and feel of every sub site under the home site.
- Involved in developing the Microsoft Custom components using C# and ASP.Net.
- Designed and developed the User Interfaces, screen layouts using JavaScript, CSS, and HTML.
- Involved in maintaining versions of source code using Team Foundation Server (TFS).
- Used Microsoft Application blocks for Data Access, Exception Handling and Logging.
- Performed Unit testing and involved in Integration testing.
- Debug code and fix any Development / QA bugs and/or modify code to in corporate the enhancements /Customizations