Big Data Architect Resume
SUMMARY:
Extremely large scale Big Data, databases and data warehouses. Broad domain knowledge. Large scale distributed systems, Knowledge of all aspects of database technology from hardware to tuning to modeling.
EXPERIENCE:
Big Data Architect
Confidential
Responsibilities:
- I developed a cloud platform base that makes very large data run faster and cost less without having to change software layered on top of it.
- Did research and POCs on CUDA GPU technology for Big Data Applications.
- Some Deep Learning.
- Hortonworks, Hive, Spark, HBase, Phoenix in on premises environment
- EMR, Hive, Spark, HBase, Phoenix in an AWS environment with EC2s
Environment: Python, PySpark, Pandas, SQL, Java, Numba, NumPy, TensorFlow, PyCUDA note: this work has been going on and off for over 3 years.
Big Data Architect
Confidential
Responsibilities:
- Provided expertise to this Cloud consulting firm .
- The main emphasis was Redshift, RDS and assorted AWS technologies.
- I did many small POC's and production starter systems.
- Performance tuning and rearchitecting for existing deployments .
- AWS technologies, Redshift, RDS, S3, Glacier, Dynamo
- Lamda, Kenesis, Pipeline, EMR, Hadoop,Spark
- Hortonworks Hive, phoenix and Hbase
- Python, Java
Big Data Architect
Confidential
Responsibilities:
- Re - architected a Redshift and AWS implementation with various technologies.
- Media asset application.
- Provided Big Data and general distributed system technology, design/development and expertise.
- This was an AWS cloud implementation with heavy duty Redshift using the technologies below.
- Redshift, queues, tuning, scaling, 100tb implementation
- AWS Redshift, Data Pipeline, SQL, Loading, Compression/Encoding
- AWS administration, Hardware architecture and scaling
- VPC, S3, EC2, CloudWatch, RDS, Lambda, EMR
- Paraccel, Elasticsearch, Cassandra, Aurora
- Queries for BI, Tableau, Mentoring, Python, Java
Big Data Architect
Confidential
Responsibilities:
- Architected and implemented systems used in the Confidential and Poors financials analytics.
- Business analysis, Architecture, modeling, ETL, complex analytics queries, Technology selection, Hardware architecture.
- Pivotal Analytics, Hawq, Spark
- Hortonworks, Hbase, Phoenix, Hive
- Hortonworks, AWS, EC2, HDFS, MapR, Tez, Oozie, Talend ETL/ESB, MLlib, Python, Java
Big Data Architect
Confidential
Responsibilities:
- Created reference implementations (RI) for starter projects
- SQL Server to AWS Redshift Data Warehouse. Cassandra data sources also. EC2, RDS, Redshift,
- SQL Server to Cassandra port on Amazon EC2 and RDS SQL Server with Talend
- Amazon Redshift using Microstategy analytics and Talend
- Dimensional Big Data Warehouse using Cassandra, Pentaho and Tableau- Python, Java
Big Data Architect
Confidential
Responsibilities:
- Provide Big Data expertise and strategy
- Develop architecture and applications to meet project needs
- Graph database design
- High Availability and Disaster Recovery Sites
- System tuning, Distributed Server Architectures, Cloud Implementation
- Java programming. Web services, JavaScript, Heavy ETL
Environment: Neo4J, NoSQL, MySQL, SQL Server, Pentaho, Linux, WebMethods, TrueCrypt, Layer 7, TomcatCassandra, Hortonworks, Hadoop, Hive, Pig, Java, Python
Big Data Architect
Confidential
Responsibilities:
- Provided Big Data expertise to a service organization trying to engage Big Data services.
- Built proof of concept prototypes with various technologies.
- Provided industry solutions insight.
- Provided pre-sales support.
- Advised clients on Big Data options, technology and provided solutions.
- NOSql, Dynamo, Big Table, Hadoop, Hive, Pig, Mongo, Hbase, Zookeeper, Redis, Cassandra, HCatalog,
Environment: CouchDB, Neo4J, HyperTable, Ooozie, Sqoop, Ambari, Flume, Tableau, Pentaho, Hortonworks, Cloudera
Data Architect
Confidential
Responsibilities:
- Designed, built and implemented a staging and cleaning area for a third-party analytics package
- De-duplicated different data systems, cleaned data and rekeyed the attached data.
- Large scale data from SalesForce, Convio, Magento, ClearConnect and various MySql databases.
- Use of modeling tools, Talend ETL and Talend Data Quality.
- Heavy SQL, Java (for Talend) and Java development of Web Services XMl/SOAP interfaces to access APIs on cloud based systems. Use of Amazon RDS and other cloud environments.
Data Architect
Confidential
Responsibilities:
- Provided technical guidance and fixed some projects issues for several offshore contract software companies.
- Created and provided Talend template ETL jobs prepackaged with VisualCron and a generic dimensional data warehouse.
- De-duplicated two different data systems and rekeyed the attached data.
- Oracle, PL/SQL, SQL Server, TSQL, SSIS, SSRS, SSAS, Oracle developer, Talend,
BI/Data Warehouse Consultant
Confidential
Responsibilities:
- Responsible for the design and implementation a data integration database,.data warehouse and.BI portal
- Dimensional data warehouse design, star schema.
- Design and implementation of database ETL jobs using Talend.
- Oracle Database and SQL Server.
- Data migration from legacy systems
Environment: Oracle, PL/SQL, SQL Server, TSQL, SSIS, Stored Procedures, ER/Studio/ Toad Data Modeler, Oracle developer, Talend, MDM, Oracle Data Quality(Datanomics)
BI/Data Architect
Confidential
Responsibilities:
- Responsible for the design and implementation a dimensional data warehouse.
- Redesigned existing report database and analytics with lower cost solutions.
Environment: MySQL, Talend, Jasper, ModelRight, SQL Server, Cognos, SSIS, SSRS, SSAS
BI/Data Architect
Confidential
Responsibilities:
- Responsible for the design and implementation a workflow system.
- Developed complex SQL and was responsible for design of data portions of the system.
- Also performed C#/.Net coding. SOA. AOP, Dependency Injection, IOC.
Environment: Oracle, PL/SQL, Stored Procedures, iBatis, Spring, JQuery, Log4Net, WCF, Visual Studio