“Tomas is one of the fastest high-quality developers I've ever met. His ability to insert himself into any piece of code, no matter how large or complex, is unmatched.”
Tomas Talius
Sammamish, Washington, United States
7K followers
500+ connections
About
Head of Engineering for Google BigQuery.
Activity
-
🚀 The Nordics' biggest Data & AI event kicks off tomorrow in Stockholm! I am thrilled to share that I will be speaking at the Data Innovation…
🚀 The Nordics' biggest Data & AI event kicks off tomorrow in Stockholm! I am thrilled to share that I will be speaking at the Data Innovation…
Liked by Tomas Talius
-
What if you could classify millions of customer reviews, support tickets, or product images without leaving your SQL editor? Interesting…
What if you could classify millions of customer reviews, support tickets, or product images without leaving your SQL editor? Interesting…
Liked by Tomas Talius
-
Looking to learn about Conversational Analytics in BigQuery? Check out this deep dive article by Aryan Irani that show how you can build…
Looking to learn about Conversational Analytics in BigQuery? Check out this deep dive article by Aryan Irani that show how you can build…
Liked by Tomas Talius
Experience
Publications
-
Hyperspace: The Indexing Subsystem of Azure Synapse
Hyperspace: The Indexing Subsystem of Azure Synapse
Microsoft recently introduced Azure Synapse Analytics, which offers an integrated experience across data ingestion, storage, and
querying in Apache Spark and T-SQL over data in the lake, including files and warehouse tables. In this paper, we present our
experiences with designing and implementing Hyperspace, the indexing subsystem underlying Synapse. Hyperspace enables users
to build multiple types of secondary indexes on their data, maintain them through a multi-user concurrency…Microsoft recently introduced Azure Synapse Analytics, which offers an integrated experience across data ingestion, storage, and
querying in Apache Spark and T-SQL over data in the lake, including files and warehouse tables. In this paper, we present our
experiences with designing and implementing Hyperspace, the indexing subsystem underlying Synapse. Hyperspace enables users
to build multiple types of secondary indexes on their data, maintain them through a multi-user concurrency model, and leverage
them automatically—without any change to their application code—
for query/workload acceleration. Many requirements of Hyperspace are based on feedback from several enterprise customers. We
present the details of Hyperspace’s underlying design, the userfacing APIs, its concurrency control protocol for index access, its
index-aware query processing techniques, and its maintenance
mechanisms for handling index updates. Evaluations over standard
industry benchmarks and real customer workloads show that Hyperspace can accelerate query execution by up to 10x and in certain
real-world workloads, even up to two orders of magnitude.Other authorsSee publication -
Transaction Log Based Application Error Recovery and Point In-Time Query.
VLDB
· Database backups have traditionally been used as the primary mechanism to recover from hardware and user errors. High availability solutions maintain redundant copies of data that can be used to recover from most failures except user or application errors. Database backups are neither space nor time efficient for recovering from user errors which typically occur in the recent past and affect a small portion of the database. Moreover periodic full backups impact user workload and increase…
· Database backups have traditionally been used as the primary mechanism to recover from hardware and user errors. High availability solutions maintain redundant copies of data that can be used to recover from most failures except user or application errors. Database backups are neither space nor time efficient for recovering from user errors which typically occur in the recent past and affect a small portion of the database. Moreover periodic full backups impact user workload and increase storage costs. In this paper we present a scheme that can be used for both user and application error recovery starting from the current state and rewinding the database back in time using the transaction log. While we provide a consistent view of the entire database as of a point in time in the past, the actual prior versions are produced only for data that is accessed. We make the as of data accessible to arbitrary point in time queries by integrating with the database snapshot feature in Microsoft SQL Server.
Other authorsSee publication -
Adapting Microsoft SQL Server for cloud computing
International Conference on Data Engineering - ICDE
Cloud SQL Server is a relational database system designed to scale-out to cloud computing workloads. It uses Microsoft SQL Server as its core. To scale out, it uses a partitioned database on a shared-nothing system architecture. Transactions are constrained to execute on one partition, to avoid the need for two-phase commit. The database is replicated for high availability using a custom primary-copy replication scheme. It currently serves as the storage engine for Microsoft's Exchange Hosted…
Cloud SQL Server is a relational database system designed to scale-out to cloud computing workloads. It uses Microsoft SQL Server as its core. To scale out, it uses a partitioned database on a shared-nothing system architecture. Transactions are constrained to execute on one partition, to avoid the need for two-phase commit. The database is replicated for high availability using a custom primary-copy replication scheme. It currently serves as the storage engine for Microsoft's Exchange Hosted Archive and SQL Azure.
Other authorsSee publication
Patents
-
Data seeding optimization for database replication
Filed US US20140236887A1
-
Systems and methods for the utilization of metadata for synchronization optimization
Issued US US8046424B2
-
Seamless upgrades in a distributed database system
Filed US US20120239616A1
-
Increasing database availability during fault recovery
Filed US US20120124001A1
-
Reorganization of data under continuous workload
Filed US US20110225122A1
-
Logical data backup and rollback using incremental capture in a distributed database
Filed US US20110191299A1
-
Hosting multiple logical databases contained in physical database
Filed US US20110179008A1
-
Extending hierarchical synchronization scopes to non-hierarchical scenarios
Filed US US20080034012A1
-
Synchronization move support systems and methods
Filed US US20060242443A1
-
Systems for the implementation of a synchronization schemas
Filed US WO2005024626A1
Recommendations received
1 person has recommended Tomas
Join now to viewMore activity by Tomas
-
🚀 🚀 BigQuery just launched the new optimized mode for AI functions (https://lnkd.in/g68VTgQ8), bringing up to 230x cost reduction and 150x…
🚀 🚀 BigQuery just launched the new optimized mode for AI functions (https://lnkd.in/g68VTgQ8), bringing up to 230x cost reduction and 150x…
Liked by Tomas Talius
-
One of the coolest BigQuery features we announced at Next'26 - making it possible to analyze large datasets with AI via Optimized mode for BigQuery…
One of the coolest BigQuery features we announced at Next'26 - making it possible to analyze large datasets with AI via Optimized mode for BigQuery…
Shared by Tomas Talius
-
VS Code and Conversational Analytics (aka Data Q&A aka NL2SQL) - that was not on my 2026 H1 Bingo Card. This is really convenient since I find…
VS Code and Conversational Analytics (aka Data Q&A aka NL2SQL) - that was not on my 2026 H1 Bingo Card. This is really convenient since I find…
Liked by Tomas Talius
-
It was great to attend the Google Cloud Next event, engaging with customers, colleagues, and partners to discuss Agents and BigQuery. I had the…
It was great to attend the Google Cloud Next event, engaging with customers, colleagues, and partners to discuss Agents and BigQuery. I had the…
Liked by Tomas Talius
-
Reposting BigQuery's main GCP NEXT blog and highlighting key AI/ML/Search launches that my team worked on over the past year: • 𝐀𝐮𝐭𝐨𝐧𝐨𝐦𝐨𝐮𝐬…
Reposting BigQuery's main GCP NEXT blog and highlighting key AI/ML/Search launches that my team worked on over the past year: • 𝐀𝐮𝐭𝐨𝐧𝐨𝐦𝐨𝐮𝐬…
Liked by Tomas Talius
-
𝗟𝗮𝘀𝘁 𝘄𝗲𝗲𝗸 𝘄𝗲 𝘄𝗿𝗮𝗽𝗽𝗲𝗱 𝘂𝗽 𝗮 𝗳𝗮𝗻𝘁𝗮𝘀𝘁𝗶𝗰 𝗰𝗼𝗻𝘃𝗲𝗿𝘀𝗮𝘁𝗶𝗼𝗻 𝗼𝗻 𝘁𝗵𝗲𝗖𝗨𝗕𝗘 𝗮𝘁 𝗚𝗼𝗼𝗴𝗹𝗲 𝗖𝗹𝗼𝘂𝗱 𝗡𝗲𝘅𝘁…
𝗟𝗮𝘀𝘁 𝘄𝗲𝗲𝗸 𝘄𝗲 𝘄𝗿𝗮𝗽𝗽𝗲𝗱 𝘂𝗽 𝗮 𝗳𝗮𝗻𝘁𝗮𝘀𝘁𝗶𝗰 𝗰𝗼𝗻𝘃𝗲𝗿𝘀𝗮𝘁𝗶𝗼𝗻 𝗼𝗻 𝘁𝗵𝗲𝗖𝗨𝗕𝗘 𝗮𝘁 𝗚𝗼𝗼𝗴𝗹𝗲 𝗖𝗹𝗼𝘂𝗱 𝗡𝗲𝘅𝘁…
Liked by Tomas Talius
-
I did a thing at Google.... I don't post much here anymore, but I wanted to share that today I got to see a year's worth of my work finally launch…
I did a thing at Google.... I don't post much here anymore, but I wanted to share that today I got to see a year's worth of my work finally launch…
Liked by Tomas Talius
-
3 days at #GoogleCloudNext. 100s of customer conversations. One theme dominated every room: Conversational Analytics. Every enterprise wants the…
3 days at #GoogleCloudNext. 100s of customer conversations. One theme dominated every room: Conversational Analytics. Every enterprise wants the…
Liked by Tomas Talius
-
🚀 How do you call a partnership that is going beyond that ? 😝 Larry Henderson Vinay Yerramilli Rohit R. Ahmed Ayad They became Friends That…
🚀 How do you call a partnership that is going beyond that ? 😝 Larry Henderson Vinay Yerramilli Rohit R. Ahmed Ayad They became Friends That…
Liked by Tomas Talius
-
BigQuery Graph is officially in preview! 🚀🕸️ To ground AI agents more effectively, they need more than just raw data; they need the context of how…
BigQuery Graph is officially in preview! 🚀🕸️ To ground AI agents more effectively, they need more than just raw data; they need the context of how…
Liked by Tomas Talius
-
It was a proud moment to see work which we started as an idea 2 years back , discussing and architecting a solution over a dinner with Chandu Bhuman…
It was a proud moment to see work which we started as an idea 2 years back , discussing and architecting a solution over a dinner with Chandu Bhuman…
Liked by Tomas Talius
-
Wrapping up an incredibly fruitful week in Vegas at Google Cloud Next! ☁️✨ Speaking about data agents in BigQuery was a massive honor, and the real…
Wrapping up an incredibly fruitful week in Vegas at Google Cloud Next! ☁️✨ Speaking about data agents in BigQuery was a massive honor, and the real…
Liked by Tomas Talius
-
Just wrapped up my first Google Cloud Next '26! It is incredible to witness BigQuery's evolution into the engine for the Agentic Era. The customer…
Just wrapped up my first Google Cloud Next '26! It is incredible to witness BigQuery's evolution into the engine for the Agentic Era. The customer…
Liked by Tomas Talius
-
As our GM Andi Gutmans launch our Agentic Data Cloud, proud and special moment for our partnership with #Vodafone and #Google as Ignacio Garcia, CTO…
As our GM Andi Gutmans launch our Agentic Data Cloud, proud and special moment for our partnership with #Vodafone and #Google as Ignacio Garcia, CTO…
Liked by Tomas Talius
Other similar profiles
Explore top content on LinkedIn
Find curated posts and insights for relevant topics all in one place.
View top content