Trevor Paulsen

Trevor Paulsen · 2026-02-10T17:47:30.271Z

I spent the weekend on ClickHouse ORDER BY clauses and materialized views 😑. Not glamorous, but maybe the most important design decision in this whole DIY journey analytics build. Here's the problem: journey analytics needs data from everywhere - web, mobile, CRM, transactions. But each source has different schemas, different field names, different ways of identifying users. You can't just UNION them and call it a day. So I built what I'm calling a "Data Group" layer. It takes separate source tables and combines them into one unified, queryable events table. The system: ✅ Auto-infers and merges schemas across sources ✅ Uses ClickHouse materialized views to route data in real-time ✅ Handles cookieless traffic, junk values, and missing fields ✅ Backfills historical data automatically The part I nerded out on the most: sorting data on disk by person_id, then date, then event_id. In ClickHouse, ORDER BY controls physical layout - not just query results. So every sessionization query, every funnel, every attribution model reads data sequentially instead of reshuffling it. It's the kind of invisible decision that makes everything downstream 10x faster. Will it hold up at scale? I think so, but only time will tell... 😅 Full deep-dive with SQL examples in the blog - link in comments 🔗

Lehi, Utah, United States
8K followers 500+ connections

View mutual connections with Trevor

Trevor can introduce you to 10+ people at UKG

Email or phone

Password

Forgot password?

or

New to LinkedIn? Join now

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

Join to follow

UKG

Brigham Young University

About

Experienced Product Manager with a proven track record in driving revenue growth and…

Activity

8K followers

Trevor Paulsen

Trevor Paulsen

13h
Report this post
Trevor Paulsen shared this
Turns out funnels are pretty easy to build... it's the layers underneath that do the real heavy lifting 😅... ✅ Identity stitching across channels/devices ✅ The data group layer pre-sorts events by timestamp ✅ Sessionization ✅ The semantic layer for metrics/dimensions So the funnel SQL builder ended up being shockingly small 😆. It composes step booleans, picks the right GROUP BY key, and hands the work to ClickHouse's "windowFunnel" aggregate function. That one built-in function does the timestamp scanning, conversion window enforcement, AND the longest-prefix calculation in a single pass. The lesson I keep relearning: when the foundation is right, every new feature on top gets cheaper (and easier for Claude to build!). Demo of the block in action below. Full write-up (with the SQL, the gotcha that bit me, and the architecture) in the comments 🔗

public_profile__posts
2 Comments
Trevor Paulsen

Trevor Paulsen

3w
Report this post
Trevor Paulsen shared this
One of the hardest things about building a journey analytics tool that includes an AI assistant is making sure the "chat experience" and the "actually exploring your data" experience don't feel like two different products bolted together 😬 I've been wrestling with this for weeks, and I think I finally landed on something I'm excited about. The idea: every piece of analysis in Trevorwithdata is a "block" - a table, a funnel, a chart. Blocks live inside notebooks when a user is exploring, and the exact same block can get rendered inside an Ask Trevor chat message. Same component. Same query pipeline. No parallel code paths. 🤖 The unexpected win: because blocks take a structured config instead of raw SQL, the AI never generates SQL at all. It just emits a tiny JSON config and hands it to the same engine the UI uses. Way faster, way fewer tokens. ⚡ Short demo below showing the moment Ask Trevor drops a live interactive table into a chat. Full write-up in the comments 📝 Would love to hear how you are thinking about the handoff between humans and AI in your #analytics tools? 🤔

public_profile__posts
2 Comments
Trevor Paulsen

Trevor Paulsen

1mo
Report this post
Trevor Paulsen shared this
Latest addition to Trevorwithdata? An amazing AI assistant with a very corny name: Ask Trevor 😆 If you've been following this #journeyanalytics build, you know there are a LOT of things that need to be configured before anyone can start analyzing data - warehouses, datasets, identity stitching, metrics, dimensions, segments. And every step has enough nuance that getting it wrong means your data looks weird downstream 😬 So I built an AI assistant on top of the whole thing. It uses Anthropic's Claude Agent SDK (same framework that powers Claude Code) with 13 custom MCP tools that map to every API in Trevorwithdata. The idea is simple - anything a user can do through the UI, the assistant can do too. Here's what it actually does in practice: you describe your data and your KPIs in plain language, and the assistant handles connecting your warehouse, creating datasets, setting up identity stitching, defining metrics and dimensions - the whole pipeline. It's not just automating clicks. It's making real decisions, like recognizing that "revenue per visitor" needs two standard metrics created before it can build the calculated one, or defaulting to person-scoped segments for cross-device journeys. The secret sauce is honestly a 1,800-line system prompt where I crammed everything I know about journey analytics 😅. Decision rules, workflow patterns, scoping logic - all the domain knowledge that would normally take weeks to learn. The vision goes way beyond just setup (analysis, reporting, exploration are next), but I wanted to show where it was so far. Check out my latest post to see how I built it (and some of the system prompts I created) - link in comments 🔗

public_profile__posts
4 Comments
Trevor Paulsen

Trevor Paulsen

1mo
Report this post
Trevor Paulsen shared this
Rounding out the last bit of componentry needed for analysis this week with calculated metrics! 🎉 Now that we've got metrics, dimensions, and segments in place for our DIY #journeyanalytics platform, calculated metrics (conversion rate, average order value, revenue per visitor, etc.) were the last we needed before we shift focus to actual analysis and AI capabilities. It's tablestakes #analytics functionality, but a couple things in the implementation turned out interesting: 📐 Formula-level segments - you can embed segment filters directly inside a formula. So conversion rate isn't just "orders / sessions," it's {sessions:sessions_with_purchase} / {sessions}. This allows you to filter the numerator to only converting sessions while keeping the denominator unfiltered, and even compare mobile vs desktop conversion rates side by side in the same report. 🤖 Auto-segment creation - halfway through building the calc metric flow, I realized how annoying it is to leave the formula builder to create a segment you need. So now if the AI assistant sees you need a segment you don't have yet, it can create it without losing your place. Small thing, but sometimes it's the little things in life...😆 📊 Live preview - the formula builder runs your metric against real data as you build it, so you can immediately tell if you got it right. Full walkthrough with demo video in the blog post - link in comments 🔗

public_profile__posts
1 Comment
Trevor Paulsen

Trevor Paulsen

1mo
Report this post
Trevor Paulsen shared this
Next up, segments! This was a fun excuse to add a few segmentation capabilities I've always wanted to see 😄 Fair warning - video is a bit longer, but so much to share! In most #analytics tools, a "segment" is just a SQL WHERE clause - each row either matches or it doesn't. In #journeyanalytics, you need things like "people who browsed the website and then called the call center in the same session" - and that requires joining cross-channel events, matching them to the same person, and verifying the order. Tough to do with a WHERE filter. So #ClaudeCode and I built a segment engine that handles three things traditional filters can't: 1️⃣ Scope control - the same condition evaluated at event, session, or person level gives you completely different answers (and the engine handles the windowing automatically) 2️⃣ Sequential patterns - a THEN keyword that verifies events happened in order, with optional time constraints like WITHIN 30m or WITHIN SESSION 3️⃣ An AI builder that inspects your actual data, generates the expression, validates it against ClickHouse, and even tells you if you already have a segment that does what you're asking for 🤖 The expressions end up being pretty readable too. Instead of nested window functions and CTEs, you write something like: {event_type} = 'page_view' THEN WITHIN SESSION IS_NOT_EMPTY({call_reason}) Full walkthrough with examples in the blog post - link in comments 🔗

public_profile__posts
2 Comments
Trevor Paulsen

Trevor Paulsen

1mo
Report this post
Trevor Paulsen shared this
Next step in the DIY journey analytics build: metrics and dimensions 🎉 "Revenue by Marketing Channel" is one of those reports that sounds trivial, but if you've ever tried to build it on top of raw event data, you know there's a lot hiding under the surface. The UTM parameters only show up on the first page view, the revenue is on a purchase event way later in the session, and sometimes that purchase event fires twice because the customer refreshed the checkout page. Getting a correct answer means solving attribution, persistence, and deduplication before you even get to the actual analysis. This is the problem that a semantic layer solves in journey analytics, and it's a different kind of problem than what BI semantic layers like LookML or MetricFlow are built for. In journey data, the dimension value and the metric value often live on completely different events, and the relationship between them is sequential. So for our build, I wanted users to only have to write the simple part (a CASE WHEN expression for channel logic, for example) and let the system handle the hard sequential stuff through configuration: 1️⃣ Dimension persistence carries values forward to downstream events 2️⃣ Metric deduplication catches duplicate events automatically 3️⃣ An AI assistant can help generate the SQL expressions from plain English Full walkthrough with examples in the blog post - link in comments 🔗

public_profile__posts
1 Comment
Trevor Paulsen

Trevor Paulsen

2mo
Report this post
Trevor Paulsen shared this
One of the things I was most proud of when we built CJA at Adobe was "report time processing" - recomputing everything at query time instead of pre-computing ahead of time. Why? Journey analytics data doesn't sit still. Identity stitching rewrites who did what. Late-arriving data reshuffles session boundaries. Records get corrected retroactively. Anything you pre-compute is potentially wrong the moment it's written. So for this DIY build, I took the same approach - a 6-layer nested SQL query that recomputes it all at query time 😅 That means sessionization, array expansion, metric computation, segment evaluation, deduplication, and final aggregation - all happening in a single nested SQL query, every time someone runs a report. Six layers deep. The part that makes it actually work? ClickHouse's physical sort order. Our tables are sorted by person_id → date → event_id on disk, which means window functions for sessionization read data sequentially instead of reshuffling. That's the difference between "seconds" and "minutes" for this kind of layered query. Fair warning, this is a nerdy one 🤓. Full blog post with SQL examples in comments 🔗

public_profile__posts
3 Comments
Trevor Paulsen

Trevor Paulsen

2mo
Report this post
Trevor Paulsen shared this
Of all the features we worked on during my time at Adobe, identity stitching was the one that generated the most questions. Here's the scenario: one person browses your site on a laptop (cookie_abc), logs into your app on their phone (device_xyz), buys something in-store (CRM-4821), then calls support (555-0142). Four interactions, four IDs, one person. Without stitching, your analytics sees four separate "people" and every journey report is wrong. I spent the weekend building identity stitching for our DIY journey analytics using ClickHouse, and the approach boils down to two things: 1️⃣ Stitch new events in real time using a JOIN against an identity map inside ClickHouse materialized views 2️⃣ Retroactively replay historical events when identity mappings change - using an atomic soft-delete + re-insert pattern so queries never see partial state The part that took the most thought was replay. When you learn that cookie_abc is actually trevor@email.com, you need to go back and re-stitch every historical event for that cookie. The solution was a single atomic INSERT that both deletes the old versions and re-inserts updated ones, and ClickHouse's ReplacingMergeTree handles the cleanup automatically. The blog post covers all of this with SQL examples, plus how we handle GDPR/CCPA deletions using the same pattern. Link in comments 🔗 If you want to kick the tires yourself, I just added a waitlist signup on the blog - I'm letting a few people try it for free 🙌 #ClaudeCode #Anthropic #JourneyAnalytics #ClickHouse #DataAnalytics #AIcoding

public_profile__posts
1 Comment
Trevor Paulsen

Trevor Paulsen

2mo
Report this post
Trevor Paulsen shared this
I spent the weekend on ClickHouse ORDER BY clauses and materialized views 😑. Not glamorous, but maybe the most important design decision in this whole DIY journey analytics build. Here's the problem: journey analytics needs data from everywhere - web, mobile, CRM, transactions. But each source has different schemas, different field names, different ways of identifying users. You can't just UNION them and call it a day. So I built what I'm calling a "Data Group" layer. It takes separate source tables and combines them into one unified, queryable events table. The system: ✅ Auto-infers and merges schemas across sources ✅ Uses ClickHouse materialized views to route data in real-time ✅ Handles cookieless traffic, junk values, and missing fields ✅ Backfills historical data automatically The part I nerded out on the most: sorting data on disk by person_id, then date, then event_id. In ClickHouse, ORDER BY controls physical layout - not just query results. So every sessionization query, every funnel, every attribution model reads data sequentially instead of reshuffling it. It's the kind of invisible decision that makes everything downstream 10x faster. Will it hold up at scale? I think so, but only time will tell... 😅 Full deep-dive with SQL examples in the blog - link in comments 🔗

public_profile__posts
7 Comments

Trevor Paulsen liked this
Report this post
Trevor Paulsen liked this

Mandy George

Mandy George

1w

Trevor Paulsen liked this
Somehow it's been a week since Summit has ended. The time always goes by so fast, so much happens at Summit that it takes a while to fully digest. This year was no different, between the keynotes announcements, roadmap items, and tips and tricks, Adobe never stops evolving. But the things that stood out most to me weren't the sessions, it was the connections I made. For four years now I've been lucky to attend Summit as one of the Adobe Analytics Champions. This program has helped me form not just connections, but also friendships, with practitioners from around the world. Summit is a bit like a reunion, getting to see people that I talk to online all the time, but only see in person once or twice a year. The Champions program is also currently open for applications for next year. If you use Analytics, AEM, AEP, Marketo, Workfront, or Commerce you can apply to be a part of this group. Check out the link below if you think you have what it takes to be a Champion. https://lnkd.in/eCDKPeZB #Adobe #AdobeSummit #AdobeSummit2026 #AdobeAnalytics #AdobeChampions

public_profile__reactions
2 Comments
Trevor Paulsen liked this
Report this post
Trevor Paulsen liked this

Eric Matisoff

Eric Matisoff

2w

Trevor Paulsen liked this
As I depart Las Vegas after another fantastic, successful, and exhilarating #AdobeSummit, I continue to be impressed by every single Sneak technology that was highlighted and every technologist that presented them 👏 As I mentioned on stage - come play with the Sneaks you saw on our interactive Sneaks experience: adobe.ly/sneaks Congrats to Tongyu Zhou 👑 Doga Dogan 🏆 Paolo Mottadelli Vishesh Goswami Yuzhe You Shivangi Aggarwal Jordan Hall — thank you all for your hard work and making this an incredible session!

public_profile__reactions
11 Comments
Trevor Paulsen reacted on this
Report this post
Trevor Paulsen reacted on this

Derek Tangren

Derek Tangren

2w

Trevor Paulsen reacted on this
Highlights for me for #AdobeSummit Day 2! The #Analytics community is seriously the best and made it so fun in our Too Hot For Main Stage session where we got great feedback about where to take our #AdobeAnalytics producgts. For anybody who missed it, you'll be able to catch the recordings once those are posted (hopefully by Friday!). Jacob Draper and Stephanie Burton also had an incredible session about #CustomerJourneyAnalytics tips and tricks that was a ton of fun. #AdobeSneaks with the amazing Eric Matisoff never disappints and Iliza Shlesinger as co-host made it a lot of fun. Wrapped up th day with #AdobeBash and #BNL put on a great show! #AdobeLIfe

public_profile__reactions
4 Comments
Trevor Paulsen reacted on this
Report this post
Trevor Paulsen reacted on this

Mandy George

Mandy George

2w

Trevor Paulsen reacted on this
At Adobe Summit one of the best sessions every year is Too Hot for the Main Stage. This year Derek Tangren and Brandon George presented using a battle of the bands theme. They demo'd a bunch of features that may or may not make it into the product. But the purpose of this session is to get feedback from the audience! One of the great things about Adobe is the way that the product teams actually listen to the feedback provided. Absolutely amazing session, and even though the live feedback is closed, still definitely worth watching once it's posted. #Adobe #AdobeSummit2026 #AdobeSummit #AdobeAnalytics #AdobeCustomerJourneyAnalytics

public_profile__reactions
3 Comments
Trevor Paulsen reacted on this
Report this post
Trevor Paulsen reacted on this

Ken Cheng

Ken Cheng

2w

Trevor Paulsen reacted on this
Decades ago I hit a jogger in my car on the way to an interview. A dilemma: Not be late for the interview Or stop to help the jogger? I decided getting a job was more important. With career success I could help hundreds of joggers. Guess who was interviewing me when I showed up? It was the jogger. He told me what happened and asked "what would you do if you were driving the car?" I said "screw the interview, I'd help the jogger." He hired me on the spot. Always tell the interviewer what they want to hear.
103 Comments
Trevor Paulsen liked this
Report this post
Trevor Paulsen liked this

Corey Spencer

Corey Spencer

1mo

Trevor Paulsen liked this
Claude Code: turning "zoning out in a meeting" into "accidentally building a prototype." It is the most productive distraction of all time.
6 Comments

See all activities

Experience

UKG

Lehi, Utah, United States
-

Lehi, Utah, United States
-

Lehi, Utah, United States
-

Lehi, Utah, United States
-

Lehi, Utah
-

Lehi, Utah
-

Lehi, Utah
-

Lehi, Utah
-

Lehi, Utah
-

Provo, Utah Area
-

Hood River, Oregon
-
-

Provo, Utah Area

Education

Brigham Young University

-

2009 - 2011
-

2008 - 2010

Recommendations received

Chad Greenleaf

“Trevor is an unassuming yet extremely competent and intelligent individual. He understands very complex technical and predictive concepts but his real strength is his ability to distill those down to a simple format for others. Despite all his talents, he keeps customers and colleagues best interests at heart at all times. He is easy to work with and I highly recommend him.”

1 person has recommended Trevor

Join now to view

View Trevor’s full profile

See who you know in common
Get introduced
Contact Trevor directly

Join to view full profile

Other similar profiles

Samartha Chandrashekar

Samartha Chandrashekar

Greater Seattle Area

Connect
Luca Candela

Luca Candela

San Jose, CA

Connect
Baolu Shen

Baolu Shen

San Francisco Bay Area

Connect
Swati Sharma

Swati Sharma

San Francisco Bay Area

Connect
王振宇Brian Wang

王振宇Brian Wang

San Francisco Bay Area

Connect
Jake Johnson, MBA

Jake Johnson, MBA

Lake Forest, CA

Connect
Hammans Stallings

Hammans Stallings

United States

Connect
Chantal Cox

Chantal Cox

San Francisco Bay Area

Connect
Diego Granados

Diego Granados

Greater Seattle Area

Connect
Rathnakumar Udayakumar

Rathnakumar Udayakumar

San Francisco, CA

Connect
Mayank Yadav

Mayank Yadav

San Francisco Bay Area

Connect
Shawn Gong

Shawn Gong

Los Angeles Metropolitan Area

Connect
Lin Yang

Lin Yang

Palo Alto, CA

Connect
Manjot Pahwa

Manjot Pahwa

United States

Connect
Tristan Rubadeau

Tristan Rubadeau

Boulder, CO

Connect
Rohan Rajiv

Rohan Rajiv

San Francisco Bay Area

Connect
James Pham

James Pham

San Francisco Bay Area

Connect
Gaurav Kapoor

Gaurav Kapoor

San Francisco Bay Area

Connect
Rovina Valashiya

Rovina Valashiya

Seattle, WA

Connect
Lance Faulkner

Lance Faulkner

Portland, OR

Connect

Explore more posts

Explore top content on LinkedIn

Find curated posts and insights for relevant topics all in one place.

View top content

Add new skills with these courses

See all courses

Trevor Paulsen

Lehi, Utah, United States 8K followers 500+ connections

About

Activity

8K followers

Trevor Paulsen

Trevor Paulsen

Trevor Paulsen

Trevor Paulsen

Trevor Paulsen

Trevor Paulsen

Trevor Paulsen

Trevor Paulsen

Trevor Paulsen

Mandy George

Eric Matisoff

Derek Tangren

Mandy George

Ken Cheng

Corey Spencer

Experience

UKG

-

-

-

-

-

-

-

-

-

-

-

-

Education

Brigham Young University

-

-

Recommendations received

Chad Greenleaf

View Trevor’s full profile

Other similar profiles

Samartha Chandrashekar

Luca Candela

Baolu Shen

Swati Sharma

王振宇Brian Wang

Jake Johnson, MBA

Hammans Stallings

Chantal Cox

Diego Granados

Rathnakumar Udayakumar

Mayank Yadav

Shawn Gong

Lin Yang

Manjot Pahwa

Tristan Rubadeau

Rohan Rajiv

James Pham

Gaurav Kapoor

Rovina Valashiya

Lance Faulkner

Explore more posts

Explore top content on LinkedIn

Add new skills with these courses

Introduction to Spark SQL and DataFrames

Data Quality: Transactions, Ingestions, and Storage

End-to-End Real-World Data Engineering Project with Snowflake

Lehi, Utah, United States
8K followers 500+ connections