Skip to content

marciogualtieri/About

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 

Repository files navigation

About

Welcome to my GitHub. In this document you will find my most relevant projects (skill-wise) highlighted. Feel free to explore all my public repositories, but bear in mind that any projects not mentioned here are likely a work in progress and thus may be incomplete.

Programming

I'm a clean coder, thus I follow clean code principles, which include TDD (I write tests first) and S.O.L.I.D. principles. I also follow BDD (Behave for Python, JBehave for Java, and ScalaTest's FunSpec for Scala).

Follow samples of my work that represent some of my coding skills:

Data Engineering

  • Universities Data Pipeline: An example of using Apache Airflow to create a data pipeline that persists data from input files into a database.

  • Data Ingestion: IoT Simulator that generates JSON data and publishes it to a Kafka topic.

  • Data Transformation: Spark Streaming job that consumes data from a Kafka topic and persists it to HBase.

  • Data Analysis: Database scripts that build an Impala table on top of a HBase table and perform a few queries.

Data Science

Follow samples of my work that represent some of my data science skills:

Technical Articles

  • Back-end Design Using AWS Serverless. That'a case study of using AWS Serverless services to implement a back-end. It uses Cognito, API Gateway, AWS Lambda, SQS, RDS, Aurora, AWS Batch, CloudWatch, SES, and IoTCore.

Certified Training

I have completed a number of courses from Coursera and edX on the subjects of machine learning, statistics and data analytics (using tools such as Spark, R and Python). You will find a list of my certified courses on my LinkedIn profile.

Follow some highlights of my certified training:

About

An Overview of My Code Portfolio on GitHub

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published