We provide IT Staff Augmentation Services!

Data Scientist Resume

4.00/5 (Submit Your Rating)

Charlotte, NC

SUMMARY:

  • Statistician/Data Scientist/Applied Mathematician: extensive experience in applied mathematics and statistics applications. Expert in SAS, MATLAB, R, S - PLUS, S, Confidential, BUGS, and Maple. Some work in Pharsight Phoenix, Monolix, and additional, domain-specific analytics tools. Experience in analytics for fraud, Comprehensive Capital Analysis and Review ( Confidential ) regulations in banking, bank-owned life insurance ( Confidential ), pharmacometrics, proteomics, genomics, virology, psychometrics, and Comparative Effectiveness Research (CER).
  • Database Architect/Software Architect/Developer: designer and developer of large database systems for analytics applications in the pharmaceutical and financial services industries. An experienced Oracle and DB2 developer; proficient in Hadoop/Hive; has recently been using Teradata Aster. As a systems architect, makes use of both relational and non-relational (e.g., Hadoop- and document-centric) models of data storage as part of an overall Big Data 2.0 approach. As a database modeler, uses entity-relationship models, UML-based object and data models, and dimensional analysis models. Aligns those with use of formal methods. Currently writing up discussions of formally defined Big Data 2.0 approaches.
  • Languages and Tools: SAS, MATLAB, Oracle PL/SQL, PERL, C/C++, Hadoop, Hive, Sqoop, Pig, Crystal Reports, ColdFusion, Visual Basic, Crystal Reports, Active Server Pages, Visual Studio 1.0 - Microsoft.Net, R, Confidential, S, S-PLUS, Maple, SAS MarketMax, CORBA, Microsoft ActiveX/COM/.Net, DCE, BPEL, and various Service Oriented Architecture (SOA ) APIs. Fluent in use of SmartDraw, ERwin, Visio, business process modeling tools such as ActiveVOS, Business Process Modeling Notation (BPMN), and formal specification and programming tools to include Z, Common Algebraic Specification Language (CASL), and Functorial Programming and Query Language ( FPQL ).

PROFESSIONAL EXPERIENCE:

Confidential, Charlotte, NC

Data scientist

Responsibilities:

  • Define technical architecture and statistical methods for fraud analytics, making use of a Big Data 2.0 model in general.
  • Using Teradata Aster and Extract, Load, and Transform (ELT) methods implemented under SAS Enterprise Miner (EM). For statistical work, using Teradata Aster routines, base SAS, Actimize, R, Bayesian models in R/STAN, and -- in early 2017 -- what is expected to be SAS 10.

Confidential, Greensboro, NC

Data scientist/Statistician

Responsibilities:

  • Developed company’s first truly statistically informed mortality models for its $38 billion in Confidential policies.
  • Wrote statistical analysis plan involving use of plotting in R, and R packages for Confidential, Generalized Linear Models (GLMs), and other statistical work.
  • Designed new set of data warehousing components for Oracle system largely written by me in 1999-2003. Implement those using a model driven architecture (MDA), PERL, and PL/SQL.
  • Added set of OLAP data cubes to speed up rollup processes by a factor of ten.
  • Defined a long-term plan pertaining to technology tools.
  • Architected and then developed Confidential ’s first system for $6 billion in insurance policies using an enterprise-wide DBMS. Migrated company from spreadsheets to system using Oracle.
  • Handled all UNIX and Oracle systems administration. System as a whole used Solaris, Oracle 8i, PERL, Apache, ColdFusion, and Crystal Reports.

Confidential, Houston, TX

Senior data scientist

Responsibilities:

  • Architected and implemented computational pipeline and analytics system for client’s first big data (Hadoop) system for customer retention and other models.
  • Estimated a 10 point in predictive ability using new methods. Use of random forests as measured by Confidential subsequently did then improve predictive ability from 75% to 85%.
  • Defined long-term recommendations for client involving approaches involving Hadoop, R, Oracle, and SAS.
  • Unfortunately, additional contracts for Confidential in the oil sector were expected due to price of oil falling by 50% and so I returned to Triangle.

Confidential, Winston Salem, NC

Senior data analyst/data scientist

Responsibilities:

  • Took over Confidential & Confidential ’s data quality (DQ) system for Comprehensive Capital Analysis and Review ( Confidential ) submissions. Moved Confidential & Confidential to use of automated programming methods for definition and use of DQ rules.
  • Developed approach to automated programming of data quality systems using algebraic set theory and semantics; approach roughly five times more efficient to use than other methods.

Confidential, RTP, NC

Architect and developer

Responsibilities:

  • Designed and built data warehouse for use in HIV, Hepatitis and other applications. Used Oracle PL/SQL, SAS, R, and PERL to create system of 100+ tables for this work.
  • Obtained data from many sources, e.g., by parsing PDF files, Excel spreadsheets, CSV files, and SAS datasets.
  • Using a Model Driven Architecture (MDA) approach, automatically generated sets of PL/SQL packages.
  • Customized reporting using R, R lattice graphics, Windows and UNIX scripting tools.
  • Did statistical analysis on genotypes and genotypes of in vitro and in vivo data involving HIV.
  • Served as member of CDISC committee on data standards in virology.

Technical lead

Confidential

Responsibilities:

  • Technical lead for clinical trials project involving pharmacokinetics and nonlinear mixed effects modeling using Confidential into FDA (21 CFR Part 11) compliance.
  • Developed system design for system implemented with a backend on Solaris or Linux and a front-end on Microsoft platforms using C#, Microsoft.Net, and Oracle.
  • Wrote specifications using UML, developed prototypes, and led technical team overall.

Confidential, Morrisville, NC

Senior Engineer: Architect and Developer

Responsibilities:

  • Defined architecture and design for Confidential, a data warehouse for use in replacing lower layer of Confidential ’s issue tracking system. Then, implemented 95% of that system.
  • Developed Confidential system as a generic “PERL-on-Oracle” system. Automated programming system generated Oracle packages, ETL procedures, etc.
  • Defined successful iterative approach to project management based upon a balance of formal and informal methods as suggested by the computer scientist Barry Boehm.
  • Wrote or automatically generated 300 pages of documentation; worked extensively with stakeholders and to determine needs and support end users.
  • Gave two posters at Population Analysis Group of Europe ( PAGE ) conference, Marseilles, France, 2008, “Open Statistical Services” and on databases for use in pharmacology.
  • Gave poster at algebraic statistics conference, Durham, NC, 2009 on formal models of data warehouses

Confidential, Cary, NC

Consultant

Responsibilities:

  • Implemented and determined effectiveness of Confidential microarray analysis that made use of the underlying biophysics, that is, models of partial hybridization of probes and targets.

Confidential, Acton, MA

Developer of insurance compliance systems

Responsibilities:

  • Over seven years, designed, wrote, and enhanced core modules of insurance-compliance systems of Confidential (now Confidential ). Primarily used Access, VBA, and SQLServer.
  • Used PERL for text mining of insurance text as well as other applications.

We'd love your feedback!