Advanced Analytics Engineer Resume
Charlotte, NC
SKILLS:
Software:
Languages: Java, Python, Apache Spark, PySpark, MatPlotLib, Seaborn, Tableau
Databases: MySQL, Solr (NoSQL)
ML / NLP Packages: TensorFlow, Keras, Stanford CoreNLP, NLTK, IBM Watson
Data Wrangling: Pandas, Numpy
CI / CD: Maven, Git, Jenkins
PROgram Management & Quality assurance: Software Release Cycle Management, Risk Management for Product Releases, Stakeholder Communication, Agile Software Development, QA Scoping and Project Planning; JIRA, Test Automation Framework, Hiring, Training, and Mentoring of QA Staff, OEM transactions
PROFESSIONAL HISTORY:
Confidential, Charlotte, NC
ADVANCED ANALYTICS ENGINEER
Responsibilities:
- Responsible for bringing Data Science & Machine Learning models into production on Big Data Apache Spark cluster - computing framework.
- The models are spread across various Lines of Business such as Commercial & Investment Banking, Risk Management et cetera.
- Reverse Engineer Data Science models in Python Jupyter Notebooks on Apache Spark to a) identify potential compute resources bottlenecks for production deployments, and b) gain in-depth understanding of Sci-kit Learn and H2O based algorithms used for given model.
- Leverage understanding of internals of DS & ML models to bridge the knowledge gap between Data Scientists and various other groups such as Data Pipeline Engineers, Business Analysts, concerned LoB departments by documenting Functional System Design of these models.
- Left Storage Software industry to transition into AI / ML domain as a self-starter.
- Developed a full-stack platform for NLP and quantitative analysis for opportunities in financial markets based on Recurrent Neural Networks, Supervised Machine Learning, and Confidential .
- Specifically, the platform accounts for following services:
Confidential
Software EngineerResponsibilities:
- A search facility for financial returns of public companies identifying important entities and keywords across filings.
- The pipeline starts with scraping SEC EDGAR to download data (BeautifulSoup, Jaunt) and then leverages combination of Confidential, GenSim (Python) and Stanford CoreNLP (Java) to clean, tokenize, and generate information on lemma, Part of Speech, NER, sentiment analysis, Word2Vec, and topic modeling.
- This tokenized dataset is further pushed to NoSQL (Solr) and can be searched (Lucene) from a User Interface (Django).
- TensorFlow & Keras based algorithms (experimental) to predict prices of equities based on regression. Core data is expanded to further generate technical indicators such as Exponential Moving Averages, Bollinger Bands, Relative Strength Index and others to increase input dataset before being fed into a Recurrent and Convolutional Neural Networks for anticipated price movements. Basic ML techniques such as Closed Form, Gradient Descent, Decision Tree, Word2Vec, Ridge, Lasso, Random Forest, Ensemble techniques using Pandas & NumPy in Python are also coded.
Confidential, Woodcliff Lake, NJ
Senior Manager
Responsibilities:
- Catalogic ECX, available as IBM Spectrum Protect Plus, allows IT infrastructure and services teams to automate recoveries of production data on various enterprise storage platforms.
- Other than data recovery, ECX also provides services such as DevOps, Disaster Recovery, Test/Dev, Storage Self Service, and Analytics.
- Led and delivered ECX release cycles including all delivery related interactions with IBM counterparts for OEM transactions.
- Planned, monitored, and tracked risk exposure & potential delays ensuring timely and effective communication of project status in terms of cost, schedule, and risk assessment.
- Managed a team of QA specialists consisting of up to 6 on-shore and 19 off-shore engineers delivering 25+ major / service pack releases across two product lines in parallel.
- End to end responsible for managing high-quality and on-time release cycles every couple of months for ECX and DPX product lines with progress indication in dashboards, prioritization, and escalation handing of bugs.
- Contributed in development of Test Automation Framework for validation of RESTful API code-base.
Confidential, Woodcliff Lake, NJ |
Senior Software Engineer & Technical LEAD
Responsibilities:
- Technical Lead & Subject Matter Expert on advance disk-based data backup solution maintaining 90%+ customer satisfaction and ~80% contract renewal rates in the Support group over a period of 7+ years in a continuous 24*7*365 support cycle.
- Took ownership and consolidated critical customer accounts with credible threat of defection managing dialogue with core Engineering group on escalations in the process guiding other engineers to resolve customer issues.
- Established an advisory role on customers' upcoming infrastructure changes and data protection needs and oversaw regular conference calls on open issues and escalations.
- Received and resolved technically complex, mission critical, and politically hot escalation tickets maintaining ownership of issues until resolution.
- Maintained patch releases and enhancements for current Software release establishing a bridge between Technical Support and Core Engineering group.