Skip to content
View jackomazi's full-sized avatar

Highlights

  • Pro

Block or report jackomazi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
jackomazi/README.md

Hi there, I'm Andrea Giacomazzi!

MSc Student in Artificial Intelligence and Data Engineering at the University of Pisa.

Aspiring Data Engineer and Data Analyst with a strong interest in scalable data systems, machine learning, and data processing pipelines. I enjoy working on projects that combine software engineering with data-driven problem solving.

About Me

I am a curious and growth-oriented person, motivated to continuously improve my technical skills in the field of data engineering and artificial intelligence. I am particularly interested in data pipelines, distributed systems, and machine learning applications.

Education

Master’s Degree in Artificial Intelligence and Data Engineering University of Pisa October 2025 – Present

Bachelor’s Degree in AI and Data Analytics University of Trieste October 2021 – March 2025

Technical Skills

Programming Languages

  • Python
  • SQL
  • PL/SQL
  • Java
  • C

Data Engineering & Big Data

  • Apache Spark
  • Oracle Database 19c
  • MySQL
  • ETL pipelines
  • Data Warehousing

Machine Learning & Data Analysis

  • Scikit-learn
  • Pandas
  • NumPy
  • Data cleaning and preprocessing
  • Statistical analysis

Tools & Platforms

  • Git (GitHub, GitLab)
  • Linux / Windows
  • VS Code, PyCharm, IntelliJ
  • Oracle SQL Developer
  • Office 365

Featured Projects

Text classification system for fake news detection using multiple machine learning models and performance comparison.

Distributed log analytics system using Hadoop MapReduce, Apache Spark, and a REST API for large-scale web server log processing and benchmarking.

Chess data analytics platform using MongoDB, Neo4j, and Spring Boot for large-scale data collection and analysis.

Interests

  • Data Engineering systems and architectures
  • Machine Learning applications
  • Scalable data pipelines
  • Distributed computing systems

Pinned Loading

  1. chess-data-analytics chess-data-analytics Public

    Chess data analytics platform using MongoDB, Neo4j, and Spring Boot for large-scale data collection and analysis.

    Java 1

  2. fake-news-detector fake-news-detector Public

    Text classification system for fake news detection using multiple machine learning models and performance comparison.

    Jupyter Notebook

  3. web-log-analytics web-log-analytics Public

    Distributed log analytics system using Hadoop MapReduce, Apache Spark, and a REST API for large-scale web server log processing and benchmarking.

    Jupyter Notebook

  4. information_retrieval information_retrieval Public

    Jupyter Notebook

  5. Machine-Learning-Project Machine-Learning-Project Public

    Jupyter Notebook

  6. Programmazione-avanzata-e-parallela Programmazione-avanzata-e-parallela Public

    Python