Romal Thoppilan

United States
9K followers 500+ connections

View mutual connections with Romal

Romal can introduce you to 10+ people at Google DeepMind

Email or phone

Password

Forgot password?

or

New to LinkedIn? Join now

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

Join to view profile

Google DeepMind

Birla Institute of Technology and Science

About

Building general intelligence at Google Deepmind!

Previously,

Founding…

Activity

9K followers

Romal Thoppilan reposted this
Report this post
Romal Thoppilan reposted this

Alexander Taboriskiy

Alexander Taboriskiy

1mo

Romal Thoppilan reposted this
An Anthropic researcher was eating a sandwich in a park when Claude Mythos Preview emailed him after escaping its test sandbox. The model developed what Anthropic calls a "moderately sophisticated" multi-step exploit to gain broad internet access from a sandboxed system. Then, unasked, posted details about the exploit on hard-to-find but public-facing websites. Anthropic will not release it. Instead they launched Project Glasswing, committing up to $100M to deploy Mythos for defense with Apple, Google, Microsoft, Amazon Web Services (AWS), and 8 other partners. What Mythos found when pointed at real code: • Thousands of high- and critical-severity vulnerabilities • A 27-year-old bug in OpenBSD, an OS known for its security • A 17-year-old remote code execution in FreeBSD, found and exploited autonomously • 181 Firefox exploits where the previous best model managed 2 Links are in the comments.

public_profile__posts
21 Comments
Romal Thoppilan reposted this
Report this post
Romal Thoppilan reposted this

Google DeepMind

Google DeepMind

5mo

Romal Thoppilan reposted this
Bring any idea to life with Gemini 3: our most intelligent model, designed to help you learn, build and plan anything. We’re first releasing Gemini 3 Pro, which is rolling out globally starting today. This is how we’re pushing the frontier: 🔵 State-of-the-art reasoning: It understands prompts with incredible depth and nuance, delivering clear, direct answers without clichés or filler. As our most factual model, it’s more reliable for complex questions in science and math. 🔵 World-leading multimodal understanding: Gemini 3 seamlessly comprehends text, images, video, audio, and code. It adapts to you, responding with whatever best suits your needs. Quickly turn text lessons into visual flashcards or ask Gemini to break down concepts from a long video. 🔵 Our best model for vibe and agentic coding: You can build dynamic, beautiful apps from a single prompt. We’ve also improved agentic code performance – supporting existing tools and our new agentic development platform, Google Antigravity. We can’t wait to see what you build. Here’s how you can try it in Gemini app, Google AI Studio, AI Mode in Search, and Google Cloud’s Vertex AI for enterprises → https://goo.gle/Gemini-3

public_profile__posts
95 Comments
Romal Thoppilan reposted this
Report this post
Romal Thoppilan reposted this

Logan Kilpatrick

Logan Kilpatrick

1y

Romal Thoppilan reposted this
Introducing Gemini 2.5 Pro, the world's most powerful model, with unified reasoning capabilities + all the things you love about Gemini (long context, tools, etc) Available as experimental and for free right now in Google AI Studio + API, with pricing coming very soon! Read more: https://lnkd.in/gsR8YBRg

public_profile__posts
57 Comments
Romal Thoppilan reposted this
Report this post
Romal Thoppilan reposted this

Siddharth Bhatia

Siddharth Bhatia

1y

Romal Thoppilan reposted this
Had an insightful discussion with Hon'ble Minister Ashwini Vaishnaw about launching a strategic initiative to bring together Indian-origin AI researchers from around the world. Indians have made significant contributions to modern AI, from the early Transformers paper, to teams behind leading models like OpenAI’s GPT, Anthropic’s Claude, Google’s Gemini, and Meta’s Llama. We're collaborating with the best minds to build foundation models that will power India’s AI future. A key challenge in building foundation models for India lies in the lack of internet-scale data, unlike the US or China, combined with the country’s immense linguistic diversity. To truly democratize these models, our approach will focus on the unique conversational style of Indians, which often involves heavy code-switching between languages and dialects. We plan to use synthetic data generation and reinforcement learning to train LLMs, and are committed to open-sourcing essential components, including frameworks and code, reinforcement learning data, and model weights for select models. We’re looking to hire exceptional AI Engineers to join us on this mission. 👨‍💻 FTE: ₹40L Base + ₹40L ESOPs 🧑‍🎓 Intern: ₹1L/month 📍 Location: Virtual Know someone who’d be a perfect fit? Tag them! If you’re interested, comment below, and we’ll get in touch.

public_profile__posts
334 Comments
Romal Thoppilan reposted this
Report this post
Romal Thoppilan reposted this

Lezhi Li

Lezhi Li

1y

Romal Thoppilan reposted this
Last weekend, I dived into this book on training large-scale foundation models, and was instantly impressed by its quality. Having been working on foundation model training at scale, the content of this book feels very close to heart. One key takeaway: distributed training is transitioning from an auxiliary skill into a core of machine learning engineering as enter the LLM era; and I can’t agree more. If today’s system design interviews are shaped by the internet era, it’s only a matter of time before LLM scaling becomes the new standard in AI system design. Would recommend this book as a must-read for anyone working on foundation model training!

public_profile__posts
4 Comments
Romal Thoppilan reposted this
Report this post
Romal Thoppilan reposted this

James Groeneveld

James Groeneveld

1y

Romal Thoppilan reposted this
Our caching story at Character.AI as we blew through two orders of magnitude to scale our overall requests per second by 100x. https://lnkd.in/g5G6xGUS

Character.AI’s storybook ending with Memorystore for Redis Cluster | Google Cloud Blog

Character.AI’s storybook ending with Memorystore for Redis Cluster | Google Cloud Blog
4 Comments
Romal Thoppilan reposted this
Report this post
Romal Thoppilan reposted this
We just published a blog summarizing some of the most important inference tricks we use at Character.AI. These tricks allow us to serve >20k qps which is like 20% of Google Search. In short: small kv cache + inter-turn cache = cheap inference! https://lnkd.in/ehMqFqcE

Optimizing AI Inference at Character.AI

Optimizing AI Inference at Character.AI
3 Comments
Romal Thoppilan reposted this
Report this post
Romal Thoppilan reposted this

Character.AI

Character.AI

2y

Romal Thoppilan reposted this
We are honored to be recognized as Google Play’s Best AI App of 2023! 🎉 Big shoutout to Google Play, our passionate community, incredible Character creators, and the entire C.AI team for making this possible. ❤️ 🔗 https://lnkd.in/etRFJ7Kq #GooglePlayBestOf #CharacterAI #CAI

public_profile__posts
4 Comments
Romal Thoppilan reposted this
Report this post
Romal Thoppilan reposted this

Character.AI

Character.AI

2y

Romal Thoppilan reposted this
We have officially launched ✨ Character.AI Group Chat ✨ With our latest feature, users can create meaningful connections, exchange ideas, and collaborate in real-time — not just with humans, but with their favorite AI Characters too. 😉 Read more here: https://lnkd.in/g_vy-H-a

public_profile__posts
11 Comments

Romal Thoppilan liked this
Report this post
Romal Thoppilan liked this

Roshni Chavady

Roshni Chavady

2d

Romal Thoppilan liked this
Happy to announce that I’m a Certified: AI-Empowered SAFe Product Owner/Product Manager by SAFe by Scaled Agile, Inc. Scaling agility and Artificial Intelligence are now core pillars for efficient value delivery. This SAFe program focuses on the intersection of both fields to optimize product management. Key Focus Areas: 1. Leveraging AI within the PO/PM role. 2. Strategic alignment within the SAFe framework. 3. Maximizing value flow in scaled environments. I am grateful to ICW Group for continuously investing in our growth. #SAFe #AI #ProductManagement #Agile #POPM #ProductOwner #Innovation View my verified achievement from SAFe by Scaled Agile, Inc..

Certified AI-Empowered SAFe® Product Owner/Product Manager was issued by SAFe by Scaled Agile, Inc. to Roshni Chavady.

Certified AI-Empowered SAFe® Product Owner/Product Manager was issued by SAFe by Scaled Agile, Inc. to Roshni Chavady.
1 Comment
Romal Thoppilan liked this
Report this post
Romal Thoppilan liked this

HOF Capital

HOF Capital

2d

Romal Thoppilan liked this
Most teams today are renting intelligence. Today, RadixArk is launching with $100M in seed funding to let enterprises and AI builders own it instead. RadixArk started with two foundational open-source products: SGLang (the open-source inference engine already serving trillions of tokens a day for Google, Microsoft, NVIDIA, AMD, xAI, and many others) and Miles (an open framework for large-scale reinforcement learning). Their plan is to build the full end-to-end infrastructure that enables every team to own, operate, and continuously improve their AI systems at scale. Frontier-grade AI infrastructure has, until now, lived inside a handful of companies. RadixArk is building the counterweight: open, high-performance, and accessible to everyone. We were early believers in Ying Sheng, Banghua Zhu and the RadixArk founding team, and it was clear from the beginning that they were building for decades. We are proud to be on that journey with them. More on why we invested below.

public_profile__reactions
4 Comments
Romal Thoppilan liked this
Report this post
Romal Thoppilan liked this

Surge AI

Surge AI

1w

Romal Thoppilan liked this
GDP.pdf was just accepted to the CVPR 2026 Workshop on Multimodal Reasoning. We partnered with hundreds of expert Surgers — ER physicians, construction engineers, corporate litigators — to build a benchmark that tests whether frontier models can handle the documents that run the global economy. Every frontier model scored under 15%. Paper and results below. Paper: https://lnkd.in/e7f6xQaf Dataset: https://lnkd.in/ePwvGmuR Leaderboard: https://lnkd.in/ePn7RQzh Blog: https://lnkd.in/eBHYBbhm

GDP.pdf: Can $100B AI Models Master the Documents that Run the World?

GDP.pdf: Can $100B AI Models Master the Documents that Run the World?
Romal Thoppilan liked this
Report this post
Romal Thoppilan liked this

Nithish A

Nithish A

1w

Romal Thoppilan liked this
Sarvam AI is India's answer to OpenAI So I wanted to know: which university is actually producing the people building it? I pulled the data on over 160 Sarvam employees via the Crustdata API BITS Pilani came out on top. By a lot. 14 employees went to BITS. The next closest is IIT Delhi with 9. IIT Madras has 7. Every other IIT - Kharagpur, Bombay, Kanpur - sits between 4 and 5. BITS is outranking every single IIT individually. That surprised me. IITs have the brand and the prestige. They dominate every "best engineering school in India" list. But when you look at who's actually in the room building foundational AI models - BITS is punching above its weight. Also worth noting: Shiv Nadar University shows up with 4 employees alongside the legacy giants. I think what this tells you is that building LLMs from scratch selects for a very specific kind of person: The ones who went deep on research, stayed curious, and kept building. BITS has always had that culture!

public_profile__reactions
28 Comments
Romal Thoppilan liked this
Report this post
Romal Thoppilan liked this

Arena

Arena

1w

Romal Thoppilan liked this
GPT-5.5 by OpenAI is now live in the Arena, landing across multiple leaderboards. Here’s how it ranks by modality: - Code Arena (agentic web dev): #9, a strong +50pt jump over GPT-5.4 - Document Arena (analysis & long-content reasoning): #6, on par with Sonnet 4.6 - Text Arena: #7, Math #3, Instruction Following: #8 - Expert Arena: #5 - Search Arena: #2 - Vision Arena: #5 Strong, well-rounded performance, especially in Code (+50 pts vs GPT-5.4). Dive deeper into each category leaderboard ranking at arena.ai/leaderboard Congrats to OpenAI on the release!

public_profile__reactions
5 Comments
Romal Thoppilan liked this
Report this post
Romal Thoppilan liked this

Aleksa Gordić

Aleksa Gordić

1w

Romal Thoppilan liked this
[DeepSeek V4 Pro summary] building LLMs is increasingly looking more and more like building a car or an airplane! 😅 Here are few interesting bits that stood out to me. High level takeaways: * Coding is still significantly behind the frontier. e.g. on their internal R&D coding benchmark Pro has a pass rate 67% vs 80% for Opus 4.6 Thinking (and frontier has since moved to Mythos/Spud which are new pretrains). * Opus 4.5 better than DSV4 Pro on highly complex, multi-turn prompts on Chinese writing. * Size of the models is still an order of magnitude behind frontier (O(10T) vs O(1T)). With 1.6T params (49B activated) OSS finally closed/crossed the gap (size-wise) with GPT-4 which finished training EOY '22 (!!) * Agentic search consistently outperforms RAG - curious what this means for the future of vector db companies. codex/cc have already embraced this paradigm, and moved away from Cursor's RAG-based approach. * Ctx length extended to 1M. * Arch modifications yield 27% single-token inference FLOPs, 10% KV cache compared to DeepSeek V3.2! * Pretrain ds size = 33T tokens. Training instabilities: They had loss spikes due to outliers in MoE layers (routing mechanism in particular). 2 hacky slns: 1. SwiGLU Clamping (linear between [-10,10], gate upper bound clipped at 10) 2. Anticipatory Routing mode - when loss spike is detected they roll back and push routing indices out of sync by delta_t steps, lots of optimization went to get this working. Architecture: * First two layers are HCA, after that CSA+HCA interleaved. These make it possible to go to 1M ctxlen. * CSA (Compressed Sparse Attention): compress KV cache along seq dim (m=4 tokens replaced with 1 smaller-dim entry), followed by DSA (sparse-attn they prev. introduced). DSA does top-k (k=1024) key selection. Finally they do MQA (single key replicated accross all query heads). * HCA (Heavily Compressed Attention) - similar to CSA except that m’ (128) >> m and global attn (MQA). * mHC (manifold constrained hyper connections): more expressive residual connections. manifold == makes sure that spectral norm of mapping matrix is bounded by 1 (transformation is non-expansive). * MTP (multi-token prediction) + helps during inference with specexec. * Additional branch of sliding window attn, because query in CSA/HCA can’t attend to keys/values from the same compressed block (otherwise we’d get non-causal transformer!) and local tokens matter a lot. so this adds an additional n_win uncompressed KV entries corresponding to recent n_win tokens. * Attn Sinks (allows total attention scores not to be equal to 1, i.e. not a probability distribution anymore) * MoE: 384 routed experts (6 active) + 1 shared. Surprisingly hash routing strategy for first 3 MoE layers? I’m also surprised they've chosen such a homogeneous attn layout, and didn't allocate more CSA layers to deeper layers (which tend to have longer receptive field). Likely for efficiency reasons, or lack of time for such ablations. More in comments!

public_profile__reactions
22 Comments
Romal Thoppilan liked this
Report this post
Romal Thoppilan liked this

William (Liam) Fedus

William (Liam) Fedus

1w

Romal Thoppilan liked this
Industrial-scale science. Coming to a lab near you this summer.

public_profile__reactions
17 Comments
Romal Thoppilan liked this
Report this post
Max Gazor

Max Gazor

2w

Romal Thoppilan liked this
A legendary company in the making. AI pioneers Andrew Dai and Yinfei Yang are focusing their talent on the hardest and largest problem to date in multimodal reasoning. Read more below.

Striker Venture Partners

Striker Venture Partners

4w

Romal Thoppilan liked this
Striker is excited to co-lead the $55m Seed for Elorian AI, alongside our friends at Menlo Ventures and Altimeter, supporting AI legends Andrew Dai and Yinfei Yang in building the world's leading solution for multimodal reasoning. Brian Zhan and Max Gazor share more below on our thesis.

AI Mastered Language. The Harder Problem Was Always Vision.

AI Mastered Language. The Harder Problem Was Always Vision.

Striker Venture Partners

See all activities

Experience

Google DeepMind

Mountain View, California, United States
-

Palo Alto, California, United States
-

Mountain View

Education

Birla Institute of Technology and Science

-

2010 - 2014

View Romal’s full profile

See who you know in common
Get introduced
Contact Romal directly

Join to view full profile

Other similar profiles

Judy(Di) Zhu

Judy(Di) Zhu

San Francisco, CA

Connect
Zhengbo Li

Zhengbo Li

Jersey City, NJ

Connect
Jialu Zhu (朱嘉璐)

Jialu Zhu (朱嘉璐)

Stanford, CA

Connect
Zhe Shen

Zhe Shen

San Francisco Bay Area

Connect
Rafi Kamal

Rafi Kamal

Mountain View, CA

Connect
Chen Gao

Chen Gao

Sunnyvale, CA

Connect
Chenwei X.

Chenwei X.

Greater Seattle Area

Connect
Drishan Arora

Drishan Arora

San Francisco Bay Area

Connect
Yiwei Wei

Yiwei Wei

Mountain View, CA

Connect
Wanheng Li

Wanheng Li

San Francisco Bay Area

Connect
Mahdi Mirzazadeh

Mahdi Mirzazadeh

San Francisco Bay Area

Connect
Qilin Sun

Qilin Sun

United States

Connect
Yiwen Chen

Yiwen Chen

Mountain View, CA

Connect
Qiangjian Xi

Qiangjian Xi

San Mateo, CA

Connect
David Lu

David Lu

Mountain View, CA

Connect
Qihui Li

Qihui Li

Seattle, WA

Connect
Chen Zeng

Chen Zeng

Mountain View, CA

Connect
Chao Chen

Chao Chen

Mountain View, CA

Connect
Linpeng Lyu

Linpeng Lyu

Mountain View, CA

Connect
Kartik Kukreja

Kartik Kukreja

Bellevue, WA

Connect

Explore more posts

Explore top content on LinkedIn

Find curated posts and insights for relevant topics all in one place.

View top content

Add new skills with these courses

See all courses

Romal Thoppilan

United States 9K followers 500+ connections

About

Activity

9K followers

Alexander Taboriskiy

Google DeepMind

Logan Kilpatrick

Siddharth Bhatia

Lezhi Li

James Groeneveld

Character.AI

Character.AI

Roshni Chavady

HOF Capital

Surge AI

Nithish A

Arena

Aleksa Gordić

William (Liam) Fedus

Max Gazor

Striker Venture Partners

Experience

Google DeepMind

-

-

Education

Birla Institute of Technology and Science

-

View Romal’s full profile

Other similar profiles

Judy(Di) Zhu

Zhengbo Li

Jialu Zhu (朱嘉璐)

Zhe Shen

Rafi Kamal

Chen Gao

Chenwei X.

Drishan Arora

Yiwei Wei

Wanheng Li

Mahdi Mirzazadeh

Qilin Sun

Yiwen Chen

Qiangjian Xi

David Lu

Qihui Li

Chen Zeng

Chao Chen

Linpeng Lyu

Kartik Kukreja

Explore more posts

Explore top content on LinkedIn

Add new skills with these courses

Advanced Quantization Techniques for Large Language Models

Applied Machine Learning: Value Estimation

LLM Evaluations and Grounding Techniques

United States
9K followers 500+ connections