Cristian Leo

Cristian Leo · 2025-11-25T17:00:01.070Z

If you're heading to AWS re:Invent this year, come join Danny Cortegaca and me for SEC342 - Threat Modeling as Code: Transforming Your Threat Statements into Attack Trees 🌳🔐 We'll demonstrate how to leverage AI to transform a threat modeling process—automatically converting threat statements and architecture diagrams into detailed attack trees, attack paths, and mitigation controls using threat intelligence data. 📅 Thursday, December 4 🕓 11:30 AM - 12:30 PM PST 📍 Mandalay Bay | Level 3 South | Jasmine H | Las Vegas Special thanks to Anton Dykyi and Daniel Begimher for their contributions to this work! Looking forward to connecting at re:Invent! 🚀 #AWSreInvent #reInvent2025 #ThreatModeling #ApplicationSecurity #CloudSecurity #DevSecOps #AWS #AIforSecurity #AppSec https://lnkd.in/eNURZEKr

New York, New York, United States
5K followers 500+ connections

View mutual connections with Cristian

Cristian can introduce you to 10+ people at Amazon Web Services (AWS)

Email or phone

Password

Forgot password?

or

New to LinkedIn? Join now

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

Join to follow

Amazon Web Services (AWS)

Columbia University in the City of New York

Portfolio

About

I work on AI systems for cybersecurity, with a focus on LLM agents, evaluation…

Articles by Cristian

OpenAI o1 Is Thinking

Sep 22, 2024

OpenAI o1 Is Thinking

How Reinforcement Learning helps with complex reasoning by producing a long internal chain of thought OpenAI is back…

Activity

5K followers

Cristian Leo

Cristian Leo

4d
Report this post
Cristian Leo shared this
AI agents do not fail only because they lack knowledge. They fail because real work has multiple decision points. That is why I found OpenAI's recent GeneBench paper interesting. The benchmark looks at multi-stage scientific inference, not one-shot question answering. In other words: can an agent keep making the right decisions across a chain of reasoning, tool use, estimates, and intermediate assumptions? That is much closer to how useful AI systems break in practice. A few details stood out: - GeneBench includes 103 benchmark problems - GPT-5.5 reaches a 25.0% overall pass rate in the main reported setting - A separate GPT-5.5 Pro harness reaches 33.2% - Reasoning effort helps, but a large unsolved tail remains The important point is not "which model won." The important point is that single-answer benchmarks can hide process failure. For builders, I think this is the lesson: If your agent needs to do real work, evaluate the chain. Do not only ask: "Did it get the final answer?" Ask: - Did it choose the right intermediate assumptions? - Did it recover from uncertainty? - Did it use tools at the right time? - Did one early mistake poison the rest of the trajectory? - Can it produce a useful artifact, not just a plausible explanation? This is where agent evaluation is going. Less trivia. More workflows. Less final-answer scoring. More process-aware evaluation. #AI #AIAgents #AIEvaluation #MachineLearning

public_profile__posts
2 Comments
Cristian Leo

Cristian Leo

3w
Report this post
Cristian Leo shared this
Threat modelling is one of the highest-impact exercises for cloud security, but let's be honest—it often gets skipped because it feels too slow or complex. We wanted to change that. I’m incredibly excited to share that Danny Cortegaca and I are joining Ben Fletcher on the latest episode of the Security Ramp-Up podcast! 🎙️ We’ll be doing a deep dive into ThreatForest, the open-source, agentic threat modelling tool we built to simplify and automate this process for real-world environments. 🌳🤖 Tune in to our Twitch session on April 21st (12 PM UK time) to catch: 🛡️ An honest look at the pros and cons of AI in threat modeling ⚙️ A breakdown of ThreatForest's agentic architecture 🎯 A LIVE demo showing how it integrates directly into your security workflows If you want to maximize your limited security resources, you won't want to miss this practical session. Check out the link below and let us know what you think of the demo! 👇 🔗 https://lnkd.in/ejpNfnrD #SecurityRampUp #AWS #CloudSecurity #ThreatModeling #AI #OpenSource #InfoSec

public_profile__posts
Cristian Leo reposted this
Report this post
Cristian Leo reposted this

Daniel Begimher

Daniel Begimher

1mo

Cristian Leo reposted this
How do you know your security agent actually investigated? It says it investigated the alert. It triaged correctly 97% of the time. But when we looked closer — no new evidence gathered, no real forensic work. It was reading the alert, pattern matching to a conclusion, and writing a convincing summary. We call this alert parroting. If you only measure accuracy, you'd never catch it. So we built SIR-Bench — a benchmark and evaluation framework designed to measure what actually matters: whether an agent is performing genuine investigation or just reaching the right conclusion. 794 test cases replayed from real incident patterns in live cloud environments, with an evaluation methodology that forces agents to show their work. Join Cristian Leo and me at RBLN East — we'll walk through the framework and how it changes the way you build and evaluate investigation agents. #CyberSecurity #AIAgents #IncidentResponse #AIEvaluation #SOC #RBLN

public_profile__posts
3 Comments
Cristian Leo reposted this
Report this post
Cristian Leo reposted this

Rebellion (RBLN)

Rebellion (RBLN)

1mo

Cristian Leo reposted this
AI in security needs more than speed — it needs depth. Excited to hear Daniel Begimher and Cristian Leo from Amazon Web Services (AWS) at RBLN East on: SIR-Bench: Evaluating Investigation Depth in Security Incident Response Agents From just alerting → to truly investigating: • Detect vs. investigate — both matter • AI must prove findings, not repeat alerts • Benchmarks built from real incidents, safely • Depth varies by attack type • Framework to check if your AI is ready for production https://lnkd.in/esG6hd-W #RBLN #CyberSecurity #AI #AWS #SecOps #IncidentResponse

public_profile__posts
Cristian Leo reposted this
Report this post
Cristian Leo reposted this

Danny Cortegaca

Danny Cortegaca

1mo

Cristian Leo reposted this
We just published a major update to ThreatForest - our agentic threat modeling tool! The re-architecture means significant efficiency gains and improved quality of attack steps and mitigations. Oh, you also get a snazzy new browser UI. You can run this on a simple architecture diagram to start understanding the main ways the components could be misused and what to do about it, or against complex repositories to receive highly relevant attack steps and mitigations. Runs locally, just needs your AWS profile with Bedrock access. Try it out and let us know! https://lnkd.in/eB-r86st Cristian Leo Anton Dykyi Daniel Begimher Prakash Jha

GitHub - aws-samples/sample-agentic-attack-tree-generator: AI-Driven Threat Modeling & Attack Tree Generation with MITRE ATT&CK Integration

GitHub - aws-samples/sample-agentic-attack-tree-generator: AI-Driven Threat Modeling & Attack Tree Generation with MITRE ATT&CK Integration
4 Comments
Cristian Leo

Cristian Leo

2mo
Report this post
Cristian Leo posted this
Benchmarks tell you a model passed the test. They don't tell you if it actually studied. I ran a causal-tracing experiment comparing SecureBERT 2.0 (fine-tuned on cyber) vs. ModernBERT (general purpose). By swapping activations between "clean" and "corrupt" runs, I traced the exact layers driving factual recall. Both models store their cybersecurity facts in Layer 18 (out of 22). Knowledge is retrieved precisely at the prediction site, not scattered through the context words. The specialist (SecureBERT) cares less about context than the generalist (ModernBERT). Its knowledge is so "sticky" that corrupting the prompt didn't even confuse it. I also released the tool I used (NeuroTrace) as an open-source library. You can now run this same "surgery" on your own medical, legal, or financial models in about 10 lines of code. Read the full article here: https://lnkd.in/ebrgh4gb #MachineLearning #Cybersecurity #LLM #Interpretability #DataScience #OpenSource
1 Comment
Cristian Leo

Cristian Leo

3mo
Report this post
Cristian Leo shared this
Your 1M token context window is just expensive noise. 🗑️ We’ve spent the last year stuffing LLMs with the equivalent of War and Peace x100, only to realize the model can’t find the plot. It’s called Context Rot, and it’s basically AI indigestion. MIT OASYS just dropped a reality check: We don't need bigger context windows... we need better way to handle the context. Recursive Language Models (RLMs) do just that. Instead of trying to "memorize" the whole internet in one prompt, RLMs act like a smart human: they search what they need, when they need it, and ignore the noise. It’s the difference between memorizing the encyclopedia and just knowing how to use the index. This happens because the context is not the input to the LLM, but an environment, and the LLM itself can search within it. I wrote a breakdown of the paper and built a Proof of Concept in Python. Read the full tutorial here: https://lnkd.in/dB2ptWJr Read the original paper here: https://lnkd.in/dmn5PNDk #AI #MachineLearning #ContextRot #LLM #Python

public_profile__posts
1 Comment
Cristian Leo reposted this
Report this post
Cristian Leo reposted this

Danny Cortegaca

Danny Cortegaca

5mo

Cristian Leo reposted this
Here it is! Our Re:Invent 2025 code talk AND we've open sourced our tool for anyone to use and integrate into their own threat modeling processes. Would love to hear about your experience using our attack tree generator (AKA ThreatForest 🌳) Cristian Leo Anton Dykyi Daniel Begimher #threatmodeling #reinvent2025 #security https://lnkd.in/eB-r86st https://lnkd.in/e3XzsiRx

AWS re:Invent 2025 - Threat-Modeling-As-Code - Transforming Your Threat Statements into Attack Trees

AWS re:Invent 2025 - Threat-Modeling-As-Code - Transforming Your Threat Statements into Attack Trees
2 Comments
Cristian Leo

Cristian Leo

5mo
Report this post
Cristian Leo shared this
If you're heading to AWS re:Invent this year, come join Danny Cortegaca and me for SEC342 - Threat Modeling as Code: Transforming Your Threat Statements into Attack Trees 🌳🔐 We'll demonstrate how to leverage AI to transform a threat modeling process—automatically converting threat statements and architecture diagrams into detailed attack trees, attack paths, and mitigation controls using threat intelligence data. 📅 Thursday, December 4 🕓 11:30 AM - 12:30 PM PST 📍 Mandalay Bay | Level 3 South | Jasmine H | Las Vegas Special thanks to Anton Dykyi and Daniel Begimher for their contributions to this work! Looking forward to connecting at re:Invent! 🚀 #AWSreInvent #reInvent2025 #ThreatModeling #ApplicationSecurity #CloudSecurity #DevSecOps #AWS #AIforSecurity #AppSec https://lnkd.in/eNURZEKr

public_profile__posts
3 Comments

Cristian Leo liked this
Report this post
Cristian Leo liked this

Stefano Ciprietti

Stefano Ciprietti

19h

Cristian Leo liked this
Three weeks later, I am still thinking about what we built. Bringing the 𝐈𝐭𝐚𝐥𝐢𝐚𝐧 𝐒𝐲𝐦𝐩𝐨𝐬𝐢𝐮𝐦 𝟐𝟎𝟐𝟔 to Columbia Business School was the result of eight months of work, countless conversations, and a team of students who turned an ambitious idea into something real. This year, I had the privilege of leading the Symposium from day one. After working on the marketing side last year, the challenge was clear: make this edition even stronger. It was not easy. It took structure, feedback, long days, problem solving, and trust. But seeing the rooms full of students, hearing the questions, and watching people engage made every effort worth it. What made this edition special was the diversity of perspectives we brought together. Across two days, our panels connected students with leaders from different industries and backgrounds, creating conversations that went beyond a traditional academic event. One of the most meaningful moments for me was moderating the startup panel alongside Flavia Bottai, who was an incredible partner and co-moderator. Together, we spoke with Francesco Piccoli, CEO and Co-Founder of Almanax, and Eugenio Donati, Founder of AeroVect, about startups, innovation, and what it really means to build something from the ground up. A special thank you to our speakers and guests: • Giuseppe Pastorelli — Consul General of Italy in New York • Marco M. — Head of Eni US Relations • Francesco Corvaro — Italian Special Envoy for Climate Change • Gabrio Santoni — Vice President, Santoni • Gherardo Guarducci — Co-founder, Sant Ambroeus • Filippo Gori — Co-Head of Global Banking, JP Morgan • Silvia Pezzini, PhD — Adjunct Professor, Columbia SIPA • Francisco Milone — Managing Partner, Värde Partners • Marco Leona — Director of Scientific Research, The Met The opening night at the Istituto Italiano di Cultura was another moment I will not forget. Hearing remarks from Giuliano Iannaccone, Robert Allegrini, and Claudio Pagliara gave the Symposium a meaningful start and reminded us of its purpose: building a bridge between Italian excellence, students, and the next generation of leaders. A huge thank you to my Symposium team: Niccolò Bitetto, Carolina Manca, Maria Elena Ardita and Gabriele Usai. Coordinating 30+ people during the event was only possible because of the trust, energy & commitment you brought every step of the way. I know I can be demanding when I care deeply about something, but this was possible because you cared too. Thank you also to the United Italian Societies leadership team in London for the support, guidance, and trust, and to our sponsors and partners: Tarter Krinsky Italy practice, Campari Group, National Italian American Foundation, Gomry. Looking back, I will remember the passion of the students, the energy of the team, and the feeling that together we created something meaningful. 𝗣𝗿𝗼𝘂𝗱 𝗼𝗳 𝘄𝗵𝗮𝘁 𝘄𝗲 𝗯𝘂𝗶𝗹𝘁. 𝗚𝗿𝗮𝘁𝗲𝗳𝘂𝗹 𝗳𝗼𝗿 𝗲𝘃𝗲𝗿𝘆𝗼𝗻𝗲 𝘄𝗵𝗼 𝗺𝗮𝗱𝗲 𝗶𝘁 𝗽𝗼𝘀𝘀𝗶𝗯𝗹𝗲.

public_profile__reactions
8 Comments
Cristian Leo liked this
Report this post
Cristian Leo liked this

Eric Vyacheslav

Eric Vyacheslav

1d

Cristian Leo liked this
Someone just built an AI database client that launches in 1 second. Most database clients are Electron apps that hog memory and feel sluggish on a Mac. TablePro is a native Mac and iPhone client built with SwiftUI. It launches in under one second and uses around 80MB of RAM. It connects to 18 databases through native drivers, skipping JDBC entirely: > MySQL, PostgreSQL, SQLite > MongoDB, Redis, Oracle > ClickHouse, BigQuery, DuckDB > Cassandra, DynamoDB, Turso The SQL editor has Vim mode, autocomplete, and multi-statement execution. A built-in AI assistant generates queries through Copilot, OpenAI, Claude, or Ollama. It ships with an MCP server, so external tools like Cursor or Raycast can drive the client directly. SSH tunnels with Touch ID protection are also built in. Install with one command: brew install --cask tablepro The whole project is open-source. Link in comments. ↓ Check out AlphaSignal.ai to get a daily summary of top models, repos, and papers in AI. Read by 280,000+ devs.

public_profile__reactions
21 Comments
Cristian Leo liked this
Report this post
Matthew Bretan

Matthew Bretan

3d

Cristian Leo liked this
Passkeys are a great example of where the right user experience can truly help security. As Steve calls out, a Passkey is not only more secure, since the user does not need to need to remember another password (or worse reuse some easy to guess password), but it speeds up the authentication process. These "small" changes that improve the user experience and security are what will truly help us all be better protected. What are some other good examples where we can improve security by just making it simpler for our users?

Stephen Schmidt

Stephen Schmidt

3d

Cristian Leo liked this
Passkeys are one of the most useful changes in user auth security in the last 10 years. Guess what? 465 million of our customers agree, and not necessarily because passkeys really improve their account security (although they do), it's because the user experience with passkeys is SOOOOOOO much better. This week marked World Passkey Day. It used to be called World Password Day. The fact that it got renamed tells you something about where we're headed. I think it's one of the most important shifts in consumer security we've seen in a while. The challenge with security has always been the tradeoff. Stronger security usually means a worse experience for the customer. Better usability usually means weaker protections. Passkeys break that pattern. Customers sign in with a fingerprint, face scan, or device PIN the same way they unlock their phone. There's nothing to remember and nothing that can be phished or guessed. And our data shows customers sign in six times faster than with a username and password. That's better security and a better experience, and it's why we've invested so heavily here. With passkeys now the default sign-in method for Amazon customers, I'm proud to share the progress we've made. As of the first quarter of 2026, more than 465 million customers have enrolled passkeys on their Amazon accounts. That's 75% year-over-year growth. We're well on our way toward a passwordless future, and we're not slowing down. The teams working on this have done outstanding work, and I'm grateful to our fellow FIDO Alliance members for the collaboration that made this possible.
1 Comment
Cristian Leo liked this
Report this post
Matthew Bretan

Matthew Bretan

4d

Cristian Leo liked this
It was true when we first launched Jams back in 2016, and it is even more true now, the best way to learn how to use the different services and capabilities that AWS makes available to our customers is by actually using those services to solve real-world problems. AWS Jams are constantly being updated to provide more opportunities to learn. If you have an opportunity to participate in one, I highly recommend it as it will be the most fun you have learning! It is always great to see how you are continuing to drive this amazing service Nir Pilo!

Nir Pilo

Nir Pilo

5d

Cristian Leo liked this
That's what happened today at the AWS Jam Lounge | AWS Summit Warsaw 🇵🇱 📸 Builders competing head-to-head on 14 real-world GenAI, Security, and DevOps missions. Here's what went down: 🏆 Teams raced up the leaderboard solving challenges like stopping prompt injection attacks, building AI agents on Amazon Bedrock, and saving a hospital's AI system from collapse. 🎯 14 missions. Real AWS services. Real pressure. Real skills earned. 🎁 Top teams walked away with exclusive Jam swag | bragging rights for the rest of the summit. The faces when a team cracks a challenge under pressure? That's the moment right there. That's why we do this. Massive thanks to everyone who showed up, competed, and proved that the best way to learn is by doing. See you at the next one. Just Jam It. 🚀 Thanks for the amazing team who help drives it : Bhavana Chowdary D. , Eva Plischke; Nicholas Sack, Armin Bube, @Georgios Karageorgiou; @George Charmanis ,Hakan Akkurt, Ashwin Bhargava, Julia Hossu ; Kayla Gorte ; #AWSJam #AWSSummit #Warsaw #GenAI #CloudSecurity #DevOps #Builders #HandsOnLearning #AmazonWebServices

public_profile__reactions
1 Comment
Cristian Leo liked this
Report this post
Matthew Bretan

Matthew Bretan

4d

Cristian Leo liked this
Great insights into how to think about deploying and managing agents across your environment!

Stephen Schmidt

Stephen Schmidt

6d

Cristian Leo liked this
Every organization I talk to is deploying AI agents, and many of them are grappling with how to do this securely. I recently had a conversation with Pascal BORNET where we dug into this topic from several angles. Here are a few things we discussed: Agents need permission to do their jobs. If you give them too much permission, things you wish hadn't happened will happen. If you give them too little permission, the agent is useless. We talked about giving agents their own identity, about prompt injection, and how adversaries bypass internal guardrails. We also covered using agents on the defensive side to automate vulnerability analysis, when humans need to remain in the loop, and how security culture shapes everything we do. If you're building with agents today, here's what I'd tell you to do right now: ✅ Get an inventory of the agents you are running, what permissions they're granted, and know where they live. You can't secure what you don't know. 👉 Make sure you have logging on everything those agents do (where logging means logs NOT on the local machine or within the agent's reach!). To secure things properly, you must be able to determine, with precision, what the agents are doing. When something goes wrong, you need to be able to reconstruct what happened, how, and why. ➡️ Build the identity and permissions infrastructure so you can control what each agent is authorized to do on behalf of each person. An agent should only have the permissions necessary for its work, when it needs to do it. 🎯 Put your constraint systems outside the agent, not inside it. Adversaries have shown they can bypass guardrails built into an agent or model, so you need an independent enforcement layer evaluating actions from the outside. The risks will keep expanding as more organizations experiment with agents. We're all working through this in real time, and the fundamentals matter more than ever. If you want to hear the full conversation, give the podcast a listen: https://lnkd.in/euvfH94h or https://lnkd.in/erATqQTk

AWS Executive Insights

AWS Executive Insights

See all activities

Experience

Amazon Web Services (AWS)

New York, New York, United States
-

New York, New York, United States
-

New York City Metropolitan Area
-

New York City Metropolitan Area

Education

Columbia University in the City of New York

4.1/4.0

2022 - 2023

Activities and Societies: Executive Member of Columbia Data Science Society Associate of Applied Analytics Club Associate of Business Management Club

Relevant Coursework: Machine Learning for Finance, Cloud Computing (AWS), Managing Data, Storytelling with Data, Applied Analytics Frameworks and Methods, Research Design, Analytics and Leading change, Applied Analytics in Organizational Context, Strategy & Analytics
2019 - 2022
-

2014 - 2019

Licenses & Certifications

Introduction to CSS3

Coursera

Issued Aug 2021

Credential ID 9XQ7RHVPC6WC

See credential
Introduction to HTML5

Coursera

Issued Jul 2021

Credential ID 5W73VK7WUKUV

See credential
Digital marketing

ECDL / ICDL Certification

Issued Jun 2021

Credential ID IT2421717
Applicazioni finanziarie con Excel

Università Bocconi

Issued May 2021

Credential ID ta1DF26LR7aCHuiTRhjwHg

See credential
Marketing in a Digital World

Coursera Course Certificates

Issued May 2021

Credential ID P5PLHC87TCHQ

See credential
Sustainable Fashion

Coursera

Issued Jan 2021

Credential ID 72XZCCV4S8F7

See credential
Management of Fashion and Luxury Companies

Coursera

Issued Dec 2020

Credential ID RYL6KWVU24YF

See credential
Fce

Cambridge Assessment

Issued May 2019

Credential ID 195IT3060004
IELTS ACADEMIC 7.5

IELTS Official

Issued Jan 2021 Expires Jan 2023

Credential ID 20IT008032LEOC010A
Ecdl

ECDL / ICDL Certification

Issued Dec 2019 Expires Dec 2022

Credential ID IT 2421717

Join now to see all certifications

Volunteer Experience

Volunteer

Piel de Mariposa

Jul 2021 - Sep 2021 3 months

Health

Publications

SIR-Bench: Evaluating Investigation Depth in Security Incident Response Agents

April 13, 2026
We present SIR-Bench, a benchmark of 794 test cases for evaluating autonomous security incident response agents that distinguishes genuine forensic investigation from alert parroting. Derived from 129 anonymized incident patterns with expert-validated ground truth, SIR-Bench measures not only whether agents reach correct triage decisions, but whether they discover novel evidence through active investigation. To construct SIR-Bench, we develop Once Upon A Threat (OUAT), a framework that replays…

We present SIR-Bench, a benchmark of 794 test cases for evaluating autonomous security incident response agents that distinguishes genuine forensic investigation from alert parroting. Derived from 129 anonymized incident patterns with expert-validated ground truth, SIR-Bench measures not only whether agents reach correct triage decisions, but whether they discover novel evidence through active investigation. To construct SIR-Bench, we develop Once Upon A Threat (OUAT), a framework that replays real incident patterns in controlled cloud environments, producing authentic telemetry with measurable investigation outcomes. Our evaluation methodology introduces three complementary metrics: triage accuracy (M1), novel finding discovery (M2), and tool usage appropriateness (M3), assessed through an adversarial LLM-as-Judge that inverts the burden of proof -- requiring concrete forensic evidence to credit investigations. Evaluating our SIR agent on the benchmark demonstrates 97.1% true positive (TP) detection, 73.4% false positive (FP) rejection, and 5.67 novel key findings per case, establishing a baseline against which future investigation agents can be measured.

Other authors
See publication
Survey of Attention Mechanisms in Encoder-Only Language Models

April 1, 2026
The introduction of the Transformer architecture in 2017 catalyzed a fundamental paradigm shift in natural language processing Vaswani et al. 2017. While decoder-only autoregressive models have come to dominate generative AI, encoder-only bidirectional models historically establish and maintain state-of-the-art results in natural language understanding, dense information retrieval, and sequence classification. This paper presents an exhaustive survey of the self-attention mechanism within…

The introduction of the Transformer architecture in 2017 catalyzed a fundamental paradigm shift in natural language processing Vaswani et al. 2017. While decoder-only autoregressive models have come to dominate generative AI, encoder-only bidirectional models historically establish and maintain state-of-the-art results in natural language understanding, dense information retrieval, and sequence classification. This paper presents an exhaustive survey of the self-attention mechanism within encoder-only models. We analyze its mathematical foundations, trace its architectural evolution from the original BERT Devlin et al. 2018 through RoBERTa Liu et al. 2019, DeBERTa He et al. 2021, and ModernBERT Warner et al. 2024, evaluate efficiency enhancements including sparse attention Zaheer et al. 2020, linear kernelized attention, and hardware-aware FlashAttention Dao et al. 2022, and dissect the ongoing theoretical debates surrounding attention-based interpretability. We further survey hybrid architectures that interleave state-space models with self-attention, and discuss the practical limits and deployment considerations of encoder attention in long-context regimes. To complement this survey, we open-source an interactive visualization tool for exploring these architectures at https://github. com/cristianleoo/attention-in-encoders.

Other authors
See publication
Geometric Concept Spaces in Small Encoders: A Comparative Mechanistic Probing of ModernBERT and DeBERTa-v3

March 28, 2026

Bidirectional transformer encoders have bifurcated into two optimization paradigms: topological precision via disentangled attention (DeBERTa-v3) and hardware-aware scaling via rotary positional embeddings (ModernBERT). This study presents an exhaustive geometric and mechanistic investigation of these architectures using 100,000 activation samples. Through linear probing, Centered Kernel Alignment (CKA), and intrinsic dimensionality estimation, we reveal a 16.5% performance gap in linear…

Bidirectional transformer encoders have bifurcated into two optimization paradigms: topological precision via disentangled attention (DeBERTa-v3) and hardware-aware scaling via rotary positional embeddings (ModernBERT). This study presents an exhaustive geometric and mechanistic investigation of these architectures using 100,000 activation samples. Through linear probing, Centered Kernel Alignment (CKA), and intrinsic dimensionality estimation, we reveal a 16.5% performance gap in linear concept separability favoring DeBERTa-v3 (p< 0.001). We identify an extreme" Topological Collapse" in ModernBERT's final layers, where concept manifolds condense from 30 dimensions to 2. We quantify a fundamental stability-precision trade-off: ModernBERT's RoPE provides 4.3 x higher local positional stability but induces severe semantic entanglement, while DeBERTa-v3 utilizes sparse, specialized sub-circuits to maintain precise orthogonal boundaries. Our findings provide a rigorous geometric explanation for the" token classification anomaly" in modern encoders.

See publication

Courses

Applied Analytics in Organizational Context

APANPS 5100
Business Process Modeling

-
Cloud Computing

APANPS 5450
Financial Analysis

-
Frameworks & Methods

APANPS 5205
Managing Data

APANPS 5400
Negotiating in English

-
Persuading in English

-
Research Design

APANPS 5300
Social Media Management

-
Storytelling with Data

APANPS 5800
Strategy & Analytics

APANPS 5600

Projects

Scrap Metal Directional Price Prediction

Sep 2023 - Dec 2023

This project focuses on analyzing financial news sentiment and utilizing machine learning models to predict stock prices based on the sentiment analysis. It integrates data from various sources, including financial news articles, stock prices, economic indicators, and weather data. The sentiment analysis is performed using two models: FinBert and GPT (Generative Pre-trained Transformer). The machine learning model for price prediction employs the CatBoost algorithm.
Quant AI

May 2023 - May 2023

As you're undoubtedly aware, the vast volume of news and rumors that emerge daily can be overwhelming, rendering it practically impossible for an individual to thoroughly process each piece of information.
To counter this challenge, we have integrated cutting-edge LLMs to develop an innovative application designed to assist users in comprehending market sentiment.
Our application harnesses the power of user-specified sources, processing and analyzing vast amounts of data with exceptional…

As you're undoubtedly aware, the vast volume of news and rumors that emerge daily can be overwhelming, rendering it practically impossible for an individual to thoroughly process each piece of information.
To counter this challenge, we have integrated cutting-edge LLMs to develop an innovative application designed to assist users in comprehending market sentiment.
Our application harnesses the power of user-specified sources, processing and analyzing vast amounts of data with exceptional accuracy.
It leverages the capabilities of LSTM models, a type of recurrent neural network well-suited for sequence prediction problems, to predict market trends.
This integration of LLMs and LSTM models provides a robust and comprehensive solution to keep up with the pace of real-time information flow, resulting in a powerful tool for understanding and predicting market sentiment.
The ultimate goal is to empower our users to make informed decisions based on accurate, up-to-date, and predictive insights. (Initially built for Tribe AI Hackathon).
Prediction of Wild Blueberry Yield | Kaggle Competition | Top 1.5% Leaderboard

Apr 2023 - May 2023

This data science project involves regression analysis using two models: LADRegression and LightGBM.
The project includes data preprocessing, feature engineering using Principal Component Analysis (PCA) and Partial Least Squares (PLS) regression, hyperparameter tuning using grid search, and model evaluation.
In particular, the project performs several predictions using different models, and then stacks them using Least Additive Regression with positive parameters.
The goal is to…

This data science project involves regression analysis using two models: LADRegression and LightGBM.
The project includes data preprocessing, feature engineering using Principal Component Analysis (PCA) and Partial Least Squares (PLS) regression, hyperparameter tuning using grid search, and model evaluation.
In particular, the project performs several predictions using different models, and then stacks them using Least Additive Regression with positive parameters.
The goal is to develop accurate regression models and optimize their performance on the given dataset.
I created this notebook for a Kaggle Competition, where I resulted in the top 1.5% of the leaderboard.
The Tinder Of Food

Mar 2023 - May 2023

This project is a Flask interactive web application that displays a map of New York City and allows users to query it, along with a recommendation algorithm that matches suppliers to restuarants.
The application uses a combination of Python, html, css, and Javascript. The data is stored using Apache Spark and MongoDB.

See project
LSTM to predict the number of weekly appointments for Columbia University

Mar 2023 - Apr 2023

In this data science project I predicted the number of weekly appointments for Columbia University. The goal of this project was to determine if there was a need to hire temporary staff based on the predicted appointment volume. To achieve this, I used first feature engineering, which involved analyzing changes in the characteristics of students over time. This helped me to identify patterns and trends that could affect the number of appointments. After performing the feature engineering, I…

In this data science project I predicted the number of weekly appointments for Columbia University. The goal of this project was to determine if there was a need to hire temporary staff based on the predicted appointment volume. To achieve this, I used first feature engineering, which involved analyzing changes in the characteristics of students over time. This helped me to identify patterns and trends that could affect the number of appointments. After performing the feature engineering, I used a deep learning model called a Long Short-Term Memory (LSTM) model to make predictions. The LSTM model was trained on the historical appointment data to learn from the patterns and trends in the data. Using the LSTM model, I was able to predict the weekly appointment volume for the next two months. This information can be used by Columbia University to make informed decisions about hiring temporary staff and managing resources efficiently.

See project
Predicting Tesla Stock from Elon Musk's tweets

Feb 2023 - Apr 2023

In this data science project I predicted the the fluctuation of Tesla Stock from Elon Musk Tweets.
The goal of this project was to predict if the stock would decrease or increase based on the previous day's Elon Musk tweets.
To achieve this, I used first data preprocessing on the tweets column, which involved removing URLs, stripping whitespaces, removing non alpha characters, stemming words and creating a TF-IDF matrix.
Secondly, I performed exploratory analysis to visualize…

In this data science project I predicted the the fluctuation of Tesla Stock from Elon Musk Tweets.
The goal of this project was to predict if the stock would decrease or increase based on the previous day's Elon Musk tweets.
To achieve this, I used first data preprocessing on the tweets column, which involved removing URLs, stripping whitespaces, removing non alpha characters, stemming words and creating a TF-IDF matrix.
Secondly, I performed exploratory analysis to visualize correlation between words and stock fluctuation, the top words used in the tweets, etc.
Then, I used feature engineering. Firstly I right merged the tesla stock dataset imported using Yahoo Finance API, then I grouped by day concatenating all the tweets that happened in the same day. After that, I was able to extract some other useful information such as the number of tweets per day, the average length of the tweets, number of emoji used and so on.
After that, I performed sentiment analysis using vader, which provided useful sentiment scores.
Finally, I performed data modeling. For this step I used both a LSTM model on the stock data, and a neural network on the NLP data. Then, I combined the outputs with a second neural network to have one final layer with one output.

See project
Using Deep Learning to predict disasters from Twitter

Feb 2023 - Mar 2023

In this project, I am using the pre-trained BERT (Bidirectional Encoder Representations from Transformers) model to classify tweets as either disaster-related or not. I train the model using a combination of cross-entropy loss and mixup regularization, and use early stopping to prevent overfitting. Overall, this project demonstrates the use of transfer learning and fine-tuning with BERT for natural language processing tasks.

See project
Building a Recommendation System with Machine Learning

Feb 2023 - Feb 2023

The project predicts the popularity of recipes, recommending the system to show or less a certain recipe. This project involves data Validation, data cleaning, exploratory analysis, feature engineering, data modeling with Genetic Tuning algorithms, SGD Classification, XGBoost Classification, Random Forest Classification, and model stacking.

See project
API application (12-Twenty & Google Cloud) - Columbia University

Jan 2023 - Feb 2023

Created an application to automatize the extraction of data from the University database, data cleaning, feature engineering, and posting data to Google data studio. The application uses two main API endpoints: 12-Twenty and Google Cloud.

See project

Languages

English

Full professional proficiency
Italian

Native or bilingual proficiency
Spanish

Full professional proficiency
Chinese

Limited working proficiency

View Cristian’s full profile

See who you know in common
Get introduced
Contact Cristian directly

Join to view full profile

Other similar profiles

Ilaria Zerbi

Ilaria Zerbi

Zurich

Connect
Benedetta Beltramelli

Benedetta Beltramelli

Italy

Connect
Giacomo Nicoli

Giacomo Nicoli

Italy

Connect
Camilla Oliva

Camilla Oliva

Milan

Connect
Pierangelo Manzo

Pierangelo Manzo

Rome

Connect
Lisa Mariscotti

Lisa Mariscotti

Milan

Connect
Francesca Martinelli

Francesca Martinelli

Milan

Connect
Marco Poliero

Marco Poliero

Como

Connect
Maria Grazia P.

Maria Grazia P.

Milan

Connect
Alvise Dal Maso

Alvise Dal Maso

Greater Padova (Padua) Metropolitan Area

Connect
Ilaria Delmonte

Ilaria Delmonte

Parma

Connect
Claudia Bottega

Claudia Bottega

Rome

Connect
Sofia Ferrari

Sofia Ferrari

Milan

Connect
Silvia Quintini

Silvia Quintini

Milan

Connect
Gabriele Babbini

Gabriele Babbini

Rome

Connect
Danilo Elia

Danilo Elia

Milan

Connect
Elisabetta Sartori

Elisabetta Sartori

Padua

Connect
Sirio Muraca

Sirio Muraca

Como

Connect
Anna Maria Cardone

Anna Maria Cardone

Italy

Connect

Explore more posts

Explore top content on LinkedIn

Find curated posts and insights for relevant topics all in one place.

View top content

Add new skills with these courses

See all courses

Cristian Leo

New York, New York, United States 5K followers 500+ connections

About

Articles by Cristian

OpenAI o1 Is Thinking

Activity

5K followers

Cristian Leo

Cristian Leo

Daniel Begimher

Rebellion (RBLN)

Danny Cortegaca

Cristian Leo

Cristian Leo

Danny Cortegaca

Cristian Leo

Stefano Ciprietti

Eric Vyacheslav

Matthew Bretan

Stephen Schmidt

Matthew Bretan

Nir Pilo

Matthew Bretan

Stephen Schmidt

Experience

-

-

-

Education

4.1/4.0

-

Licenses & Certifications

Digital marketing

Coursera Course Certificates

Fce

IELTS ACADEMIC 7.5

Ecdl

Volunteer Experience

Volunteer

Publications

April 13, 2026

April 1, 2026

March 28, 2026

Courses

Applied Analytics in Organizational Context

APANPS 5100

Business Process Modeling

-

Cloud Computing

APANPS 5450

Financial Analysis

-

Frameworks & Methods

APANPS 5205

Managing Data

APANPS 5400

Negotiating in English

-

Persuading in English

-

Research Design

APANPS 5300

Social Media Management

-

Storytelling with Data

APANPS 5800

Strategy & Analytics

APANPS 5600

Projects

Scrap Metal Directional Price Prediction

Sep 2023 - Dec 2023

Quant AI

May 2023 - May 2023

Prediction of Wild Blueberry Yield | Kaggle Competition | Top 1.5% Leaderboard

Apr 2023 - May 2023

Mar 2023 - May 2023

Mar 2023 - Apr 2023

Feb 2023 - Apr 2023

Feb 2023 - Mar 2023

Feb 2023 - Feb 2023

New York, New York, United States
5K followers 500+ connections