We provide IT Staff Augmentation Services!

Contenthubengineer Resume

5.00/5 (Submit Your Rating)

New York New, YorK

TECHNICAL SKILLS

Programming/Scripting: Java, C, C++, C#, LUA, Python, JavaScript, TypeScript, OCaml, PL/SQL, Shell/Bash, Scala

Web Development: HTML5/CSS3/JS, REST Web Services, Microservice Architecting, Spring 3/4/5, Spring Boot 2, Security, Spring Data JPA, AngularJS, Angular, Node.js – npm, cluster, Express.js, Bootstrap, Hydra Express, Python Flask, WebSocket, Play Framework

Databases: MySQL, Oracle, Microsoft SQL, MongoDB, SQLite, PostgreSQL, DynamoDB, Carmot (Proprietary HBase & Cassandra Implementation)

Version Control/Project Tools: GIT, SVN, Trello, JIRA (Scrum + Kanban), Maven, Gradle

Operating Systems: OS X, Unix, Linux (CentOS, Arch Linux), Windows

Data: Hadoop, Apache Spark, GeoSpark, Apache Kafka, Apache Bolt, Celery

Development Tools: IntelliJ IDEA, Visual Studio, Xamarin Studio, Xcode, Netbeans, Atom.io, Postman, Eclipse, PyCharm, Android Studio, PHPStorm, Datagrip

Other Technologies/Frameworks: JPA/ORM – Hibernate, Jackson, swagger, Log4J, SLF4J, Spring Security, JWT Authentication, Apache MINA, Java RMI, .NET Web API, Remoting, JUnit, Mocha, Jongo, JDBC, cluster, Redis, Google APIs, Axios, Geographic Information Systems, ArcGIS, PostGIS, pgAdmin, Zend. Splunk, Idiorm, OkHttp, Guava, Screwdriver, Jenkins, Athenz X.509, Logstash, Opentracing, SQLAlchemy, Apache Traffic Server, Yamas, Jaeger, Zipkin, APM, Guice, Caffine, Lombok, Vespa, Hamcrest, SonarQube, Mockito, Apache Pulsar,

Cloud: Amazon Web Services (AWS) EMR, S3, EC2, RDS, CloudFront, DocumentDB, SQS, SNS, Lambda, DynamoDB, Elastic Map Reduce, CodeDeploy, Terraform, Google Cloud Services (GCP), Heroku, Docker, Kubernetes, Istio, Envoy mTLS, Firebase Realtime, Crashlytics, Monitoring, Firestore


PROFESSIONAL EXPERIENCE

Content Hub Engineer

Confidential - New York, New York

Responsibilities:

  • Part of the team (mostly ex-AOL & ex-Yahoo!) mainly responsible for the infrastructure and business applications to ingest partner feeds and in- house content into a single data warehouse (HBase + Cassandra implementation) using a common canonical model (CCM) and apply enrichments on ingested content.
  • Worked on the editorial tools used by Yahoo, AOL, Huffpost, TechCrunch, Engadget and more.
  • Migrated all the topologies and services owned by the Content Processing Team from Bouncer (Legacy S2S Authentication) to Athenz X.509
  • Built Autowirable Spring Boot components and published them to the artifactory for services to be built with modularity. i.e. importing aws-auth, athenz-provider, ccm-io components.
  • Migrated all build pipelines from Screwdriverv3 to Screwdriverv4
  • Implemented logstash-logback encoding with Jaeger Opentracing to create spans & trailIds to trace any single process across the distributed ingestion & processing pipeline.
  • Feed ingestion with Python and Celery to chain tasks depending on the feed configuration & ingestion strategy.
  • Created AWS Lambda functions that listen to SNS notifications and update DynamoDB entries, call other services and fire SQS messages.
  • Considerable Ops work like setting up Ingress’s and mTLS authentication using Istio & Envoy sidecar proxies + setting up ATS (Apache Traffic Server) remap rules.
  • Built a notification system using SNS for when content gets updated to replace a legacy poll-based change log.
  • Developed Apache Bolt processes that ingested XML feeds from hundreds of partners and transformed them into the Common Content Model.
  • Drove the initiative for better logging and distributed tracing across the services that our team maintained, so much so that other teams instrumented their apps and onboarded to our Jaeger cluster.
  • Implemented Opentracing instrumentation from scratch inside of Scala’s Play 2.3 framework

Environment: - Java 8/11, Apache Bolt, Carmot (HBase & Cassandra), Python, Celery, SQLAlchemy, PHP, AngularJS, MySQL, Screwdriver, Athenz, Spring B oot, JavaScript, A m azon S3, SQ S, SN S, Lam bda, RD S, D ynam oD B, M aven, G radle, D ocker, JIRA, N ode.js, JFrog, Kubernetes, Splunk, Kibana, Elasticsearch, Logstash, SLF4J-Logback, Opentracing, XSLT, Apache Traffic Server, Jackson, Scala, Play Framework, Jaeger, Jenkins, Apache Pulsar

Software Engineer

Confidential

Responsibilities:

 
  • Involved in all stages of the startup phase & application lifecycle, focusing on algorithm implementation and developing cluster ETL routines.

  • Collaborated with other developers to integrate user-facing elements with server-side logic.
  • Built reusable code and managed cutting-edge technologies.
  • Built & implemented algorithms for valuating real estate properties based on a number of variables & exposed them via microservice endpoints.
  • Developed Apache Spark applications that scrubbed and cleaned taxation + parcel data provided by municipal, county and state governments.
  • Used GeoSpark, an unofficial extension to the spark framework for cluster computing framework for processing large-scale spatial data.
  • Batch processing + algorithm implementation on gigabytes of non-uniform CSV’s filled with clerical errors and corrupt data.
  • Map-reduced spatial joining of multiline vector polygon data with single point vectors, and then again with the uniformized CSV data.
  • Using GeoSpark to perform map-reduced geo-spatial manipulation of vector data, increased the amount of data entries we were able to validate from 60% to more than 90%.
  • Designed a data-pipeline where Amazon S3 data nodes triggered Amazon EMR Clusters that in turn updated production data.

Environment: - Java 8, JavaScript, Spring Boot – Security – Data, Hibernate, PostgreSQL, PostGIS, Apache Spark (Java), GeoSpark, ArcGIS, Google M aps A PI, A m azon S3, A m azon EMR, A m azon EC 2, A pache M aven, D ocker, B itbucket, JIR A, N ode.js, H ydra -Express, Redis, M ongoD B, A ngular, Bootstrap, HTML, CSS, JavaScript, Trello, JIRA

We'd love your feedback!