Hello, I'm
Data Scientist — LLMs · NLP · RAG · GenAI
I transform data into actionable insights and build production-ready AI systems. Currently working at SiDi and pursuing a Master's in Computer Science — passionate about bridging research and real-world impact.
I'm a Data Scientist at SiDi and a Master's student in Computer Science, based in Recife, Brazil. My work spans Generative AI architectures, Natural Language Processing, and Data Analytics — with a focus on delivering measurable value across industries.
I've collaborated with diverse teams, turning complex datasets into clear decisions. I'm particularly drawn to problems that sit at the boundary between research and production: where rigour meets usability.
Outside of code, I believe in active listening, proactive problem solving, and the kind of teamwork that makes hard things feel simple.
SiDi · Hybrid · Recife, Brazil
Leading data science initiatives with a focus on LLM integration and Generative AI pipelines. Collaborating across business units to translate complex requirements into scalable, production-ready data solutions.
SiDi · Hybrid · Recife, Brazil
Developed and maintained ML pipelines and data analytics workflows. Contributed to RAG and NLP projects, gaining hands-on experience across the full data science lifecycle in an enterprise environment.
SiDi · On-site · Recife, Brazil
Handled end-to-end data workflows including data cleaning, requirements analysis, and exploratory analytics. Built a foundation in applied data science within a large-scale tech company.
CESAR School · Remote · Recife, Brazil
Provided academic support for students in Algorithms and Data Structures. Covered core topics in C and Python, reinforcing problem-solving skills and algorithm design fundamentals.
CESAR School · Remote · Recife, Brazil
Supported students through requirements elicitation and validation processes. Evaluated deliverables using structured evaluation metrics, developing strong communication and analytical skills.
CESAR TechTrends · Remote · Recife, Brazil
Conducted research in the scientific initiation program, studying Deep Learning algorithms including MetaProd2Vec for embeddings and e-commerce product vectorization. Developed autonomous data engineering and technical research skills.
CESAR · Hybrid · Recife, Brazil
Worked as programmer and designer in CESAR's 6-week innovation program (Summer Job), in partnership with Komatsu. Applied systematic thinking and a user-centred mindset to solve real operational challenges under tight deadlines.
Proof of concept applying LLMs to personal finance management — demonstrating how language models can interpret and reason about financial data in a conversational interface.
A Query-to-SQL RAG system for consulting tabular data through natural language. Includes a UI layer, making structured databases accessible without writing a single SQL query by hand.
An evaluation framework for LLM outputs — built to benchmark and compare model responses using automated judging criteria, supporting more rigorous model assessment workflows.
Real-time circular Chladni plate simulation driven by live audio input — a generative visualisation of acoustic resonance patterns using Python and OpenGL.
Final project for the Computer Science degree — a culminating research essay combining analytical rigour with applied technical work, documented as a public Jupyter Notebook.
An interactive data science dashboard built with Streamlit and deployed publicly — demonstrating data visualisation and analytics in a clean, shareable format.
Loading posts…
Whether it's a project, a question, or just a hello — I'm open to it.