Edmund T.

Edmund T. · 2026-03-31T22:02:41.694Z

Vector RAG vs. Vectorless RAG: Moving Beyond Traditional Vector Databases? For the past year, vector databases have served as the default foundation for Retrieval-Augmented Generation (RAG). But as Large Language Models continue to advance, a fundamental question is arising: What if your AI system could truly reason through information instead of simply searching it? Enter Vectorless RAG (also known as Reasoning-based Retrieval or PageIndex). If you're designing next-generation AI applications, understanding these two approaches is crucial. Here's a clear side-by-side comparison: Vector RAG: Machine-Like Search The classic pipeline: Documents are split into chunks, converted into vector embeddings, and retrieved via similarity search. Strengths: Extremely fast and highly scalable Excellent for general-purpose, high-volume retrieval Limitations: Relevance is only approximate Chunking often breaks important context Similarity scores are largely opaque ("black box") Vectorless RAG: Human-Like Reasoning Instead of fragmenting content, this method builds a structured, hierarchical index of the full document. The LLM then intelligently navigates this structure through multi-step reasoning to locate the most relevant sections. Strengths: Preserves complete document context No vector database required Fully transparent and traceable reasoning paths Dramatically better accuracy on complex, nuanced queries Limitations: Higher computational cost Slower than pure vector search Which Should You Choose? Choose Vector RAG when speed and scale matter most - ideal for customer support bots, broad document search, and knowledge bases. Choose Vectorless RAG when precision, explainability, and reliability are critical - perfect for legal contract analysis, medical record review, financial audits, and other high-stakes enterprise use cases. As LLMs become faster and more efficient, reasoning-based retrieval approaches like Vectorless RAG are poised to become the new gold standard for sophisticated AI systems. What are your thoughts? Have you tried vectorless or hierarchical retrieval methods yet? Share your experiences below! 👇

Seattle, Washington, United States
7K followers 500+ connections

View mutual connections with Edmund

Email or phone

Password

Forgot password?

or

New to LinkedIn? Join now

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

Join to view profile

Earth's Element (Landscape Architects, Designers, Landscaping)

University of Westminster

About

27+ years of full-stack engineering and AI innovation, turning complex challenges into…

Services

Request proposal

Articles by Edmund

Designing Scalable Enterprise Agentic AI: A 2026 Reference Architecture on Microsoft Foundry

Mar 13, 2026

Designing Scalable Enterprise Agentic AI: A 2026 Reference Architecture on Microsoft Foundry

As an AI Architect based in the Seattle area (Bellevue vibes included), I've spent the past year diving deep into…
Automation - AI Workflows - AI Agents

Mar 12, 2026

Automation - AI Workflows - AI Agents

These three architectures are often confused, yet they solve very different kinds of problems: Automation AI Workflows…

Activity

7K followers

Edmund T.

Edmund T.

3d
Report this post
Edmund T. posted this
AI Hallucinations Aren’t a Bug - They’re a Byproduct of How We Build AI There’s a common misconception that modern AI systems “know” facts. They don’t. Large Language Models (LLMs) learn statistical patterns from data, not truth. When that data includes noise, bias, or misinformation (as much of the internet does), the model can produce outputs that are: Fluent Confident …and sometimes wrong This is what we call hallucination. Why Hallucinations Happen? At a technical level, LLMs are optimized to predict the most likely next token, and not the most verified fact. That leads to: False positives (incorrect but plausible answers) Fabricated details (e.g., citations, facts, or events that don’t exist) Overgeneralization from biased or incomplete data Even with fine-tuning and human feedback, this behavior cannot be fully eliminated - only controlled and mitigated. Reframing the Problem Instead of asking: “How do we eliminate hallucinations?” A better question is: “How do we design AI systems that know when they might be wrong and verify before answering?” Practical Solutions (That Actually Work) 1) Retrieval-Augmented Generation (RAG) Ground responses in trusted, real-time data sources instead of relying only on model memory. Connect LLMs to curated knowledge bases, Force citation-backed responses, Reduce fabrication risk significantly 2) Confidence & Uncertainty Modeling Make AI express how sure it is. Probability scoring Calibration techniques Threshold-based response filtering If confidence is low → defer, ask clarifying questions, or escalate. 3) Multi-Model Verification (Ensemble Systems) Don’t trust a single model. Cross-check outputs across multiple models Use consensus or voting mechanisms Flag disagreements for review 4) Tool-Using Agents Let AI verify before responding. Call search APIs Query databases Execute code for validation Shift from “generate answer” to “generate + validate answer” 5) Human-in-the-Loop (Critical Systems) For high-stakes domains (healthcare, finance, national security): AI proposes ---> human verifies ---> system learns Continuous feedback loop improves reliability 6) Domain-Specific Fine-Tuning General models struggle with specialized truth. Train on high-quality, domain-curated datasets Apply strict data governance The Strategic Insight Hallucination is not just a technical issue - it’s a system design problem. The future of reliable AI will not rely on: bigger models alone …but on: well-architected systems that combine models, tools, data, and verification layers Bottom Line AI should not be treated as a source of truth, but as a: reasoning engine that must be grounded, validated, and monitored Organizations that understand this early will build: More trustworthy systems Safer AI applications Real competitive advantage
Edmund T.

Edmund T.

3w
Report this post
Edmund T. shared this
Becoming an agentic AI engineer is not a prompt engineering skill. It is the ability to design systems that combine: reasoning (LLMs) control (state machines) tools (APIs & services) memory (context persistence) evaluation (measurable correctness) You are not building chatbots. You are building autonomous software systems with decision loops.

public_profile__posts
1 Comment
Edmund T.

Edmund T.

3w
Report this post
Edmund T. shared this
The Real Path to Becoming an Agentic AI Engineer Agentic AI engineering is not about using LLMs. It is about building autonomous, stateful, tool-driven systems that operate reliably under uncertainty. The progression moves from deterministic systems to controlled autonomy to scalable agent architectures. Phase 1 - Foundations (Deterministic Systems First) Goal: Understand LLMs as probabilistic components inside software. Core concepts Tokens, context windows, latency, cost behavior Prompting: instruction, few-shot, structured outputs (JSON/schema) REST APIs, authentication, retries, rate limits Async programming and error handling Output validation and schema enforcement Key mindset LLMs are external unreliable services, not reasoning engines. Build LLM wrapper with retries and logging Prompt to structured JSON extractor API integration service with validation Phase 2 - Agent Core (Control Loop) Goal: Build the fundamental execution loop. Agent loop Observe Think Act Evaluate Update State Repeat Core concepts Tool function calling Explicit state management Memory systems short term working context long term vector store episodic history ReAct reasoning pattern Tool registry design Key mindset An agent is a state machine with reasoning, not a prompt. Build ReAct agent with trace logs Tool system for search API calculator Persistent state object Basic memory store Phase 3 - Architecture (Multi Agent Systems) Goal: Move from single agents to systems. Core concepts Retrieval Augmented Generation Task decomposition and planning Multi agent patterns orchestrator and workers planner executor critic roles Failure handling and retries Key mindset Architecture is about reliability and failure control. Build RAG agent with grounding Planner Executor Validator pipeline Async task queue Fallback strategies Phase 4 - Safety and Evaluation Goal: Make autonomy measurable and safe. Core concepts Guardrails and schema validation Human in the loop checkpoints Metrics success rate hallucination rate tool accuracy cost latency Observability with step level logs Key mindset If it cannot be measured, it cannot be trusted. Build Evaluation harness with test cases Prompt and version tracking Feedback loop system Safety filter layer Phase 5 - Production and Scale Goal: Turn agents into production infrastructure. Core concepts Integration with APIs CRMs databases Deployment with containers serverless workers Observability logs traces metrics Security secrets and role based tool access Key mindset Scaling is not more agents it is fewer failures per execution. Build Production agent API service Queue based execution system Monitoring dashboard for cost steps errors Secure tool gateway

public_profile__posts
Edmund T.

Edmund T.

1mo
Report this post
Edmund T. shared this
Vector RAG vs. Vectorless RAG: Moving Beyond Traditional Vector Databases

public_profile__posts
Edmund T.

Edmund T.

1mo
Report this post
Edmund T. posted this
Vector RAG vs. Vectorless RAG: Moving Beyond Traditional Vector Databases? For the past year, vector databases have served as the default foundation for Retrieval-Augmented Generation (RAG). But as Large Language Models continue to advance, a fundamental question is arising: What if your AI system could truly reason through information instead of simply searching it? Enter Vectorless RAG (also known as Reasoning-based Retrieval or PageIndex). If you're designing next-generation AI applications, understanding these two approaches is crucial. Here's a clear side-by-side comparison: Vector RAG: Machine-Like Search The classic pipeline: Documents are split into chunks, converted into vector embeddings, and retrieved via similarity search. Strengths: Extremely fast and highly scalable Excellent for general-purpose, high-volume retrieval Limitations: Relevance is only approximate Chunking often breaks important context Similarity scores are largely opaque ("black box") Vectorless RAG: Human-Like Reasoning Instead of fragmenting content, this method builds a structured, hierarchical index of the full document. The LLM then intelligently navigates this structure through multi-step reasoning to locate the most relevant sections. Strengths: Preserves complete document context No vector database required Fully transparent and traceable reasoning paths Dramatically better accuracy on complex, nuanced queries Limitations: Higher computational cost Slower than pure vector search Which Should You Choose? Choose Vector RAG when speed and scale matter most - ideal for customer support bots, broad document search, and knowledge bases. Choose Vectorless RAG when precision, explainability, and reliability are critical - perfect for legal contract analysis, medical record review, financial audits, and other high-stakes enterprise use cases. As LLMs become faster and more efficient, reasoning-based retrieval approaches like Vectorless RAG are poised to become the new gold standard for sophisticated AI systems. What are your thoughts? Have you tried vectorless or hierarchical retrieval methods yet? Share your experiences below! 👇
Edmund T.

Edmund T.

1mo
Report this post
Edmund T. shared this
public_profile__posts
Edmund T.

Edmund T.

1mo
Report this post
Edmund T. shared this
public_profile__posts
Edmund T.

Edmund T.

1mo
Report this post
Edmund T. shared this
public_profile__posts

Edmund T. liked this
Report this post
Edmund T. liked this

Ravena O

Ravena O

2w

Edmund T. liked this
Are we still explaining AI to business leaders the wrong way? Most conversations about AI start with models, tools, or buzzwords. But business leaders do not care about how AI works. They care about what it changes for the business. This visual gets it right. At the core, AI and ML turn data into decisions like forecasting, pricing, fraud detection, and churn prediction. On top of that, neural networks detect patterns humans cannot like speech, images, and sentiment. GenAI adds speed by creating content, code, and answers at scale. AI agents go further by executing tasks on their own like resolving tickets or processing expenses. Agentic AI sits at the top, automating entire workflows across systems from onboarding to financial close. The shift is important. AI is moving from insights to actions. From assistance to autonomy. From tools to outcomes. If you want business buy-in, stop explaining AI by technology layers. Explain it by the work it removes, the time it saves, and the decisions it improves. Credits: John Wernfeldt

public_profile__reactions
11 Comments
Edmund T. liked this
Report this post
Edmund T. liked this

Julia Danyal

Julia Danyal

3w

Edmund T. liked this
Claude AI Cheat Sheet 𝗕𝗲𝗰𝗼𝗺𝗲 𝗯𝗲𝘁𝘁𝗲𝗿 𝗮𝘁 𝗔𝗜 𝗶𝗻 𝗷𝘂𝘀𝘁 𝟭 𝗺𝗶𝗻𝘂𝘁𝗲 𝗮 𝗱𝗮𝘆. 𝗚𝗲𝘁 𝘁𝗵𝗲 𝗔𝗜 𝗻𝗲𝘄𝘀𝗹𝗲𝘁𝘁𝗲𝗿 𝘀𝗺𝗮𝗿𝘁 𝗹𝗲𝗮𝗱𝗲𝗿𝘀 𝗿𝗲𝗮𝗱. 𝗦𝗶𝗴𝗻 𝘂𝗽 𝗳𝗿𝗲𝗲 𝗻𝗼𝘄 → aiforleaders.com Original post: __________ 1. Claude Models Use the right model for the job. • Opus 4.5 → Hard reasoning, research, complex tasks • Sonnet 4.5 → Daily writing, analysis, editing (best default) • Haiku 4.5 → Fast, cheap tasks and quick prompts All models support 200K context, which means you can feed large documents and projects. 2. Prompting Techniques The quality of your output depends on the structure of your prompt. Some of the most effective techniques: • Role playing • Chained instructions • Step-by-step prompting • Adding examples • Tree of thought reasoning • Style-based instructions The best combo usually is: Role + Examples + Step by Step. 3. Role → Task → Format Framework One of the simplest ways to improve prompts. Example structure: Act as [Role] Perform [Task] Output in [Format] Example: Act as a marketing expert Create a content strategy Output in a table or bullet points 4. Prompt Learning Methods Different prompt styles produce different outputs. • Open ended → broad exploration • Multiple choice → force clear decisions • Fill in the blank → structured responses • Comparative prompts → X vs Y analysis • Scenario prompts → role based thinking • Feedback prompts → review and improve content 5. Prompt Templates You can dramatically improve results using structured prompting. Three core styles: • Zero shot → no examples • One shot → one example provided • Few shot → multiple examples More examples usually means better outputs. 6. Projects Projects turn Claude into a knowledge workspace. You can: • Upload files as knowledge • Organize chats by topic • Add custom instructions • Share with teams • Maintain long context across work 7. Artifacts Artifacts allow Claude to generate interactive outputs like: • Code • Documents • Visualizations • HTML or Markdown apps You can read, edit, and run them directly inside the chat. 8. MCP + Connectors MCP (Model Context Protocol) connects Claude to external tools. Examples: • Google Drive • Gmail • Slack • GitHub • Figma • Asana • Databases 9. Claude Code Claude can also act as a coding agent inside the terminal. It can: • Read entire codebases • Write and test code • Run commands • Integrate with Git • Deploy projects 10. Reusable Skills + Hooks Claude supports reusable markdown instructions called Skills. Plus automation hooks like: • PreToolUse • PostToolUse • Stop • SubagentStop These help control workflows and outputs. Prompt Starters Some prompts work almost everywhere: • “Act as [role] and perform [task].” • “Explain this like I am 10” • “Compare X vs Y in a table.” • “Find problems in this document.” • “Create a step-by-step plan for [goal].” • “Summarize in 3 bullet points.” Credit to Shah Riyed Arifen. Follow him for more.

public_profile__reactions
43 Comments
Edmund T. liked this
Report this post
Edmund T. liked this

Ashish Sahu

Ashish Sahu

3w

Edmund T. liked this
→ 𝐀𝐫𝐞 𝐲𝐨𝐮 𝐬𝐭𝐢𝐥𝐥 𝐭𝐫𝐞𝐚𝐭𝐢𝐧𝐠 𝐚𝐥𝐥 𝐝𝐚𝐭𝐚𝐛𝐚𝐬𝐞𝐬 𝐭𝐡𝐞 𝐬𝐚𝐦𝐞 𝐰𝐚𝐲 The world of data is evolving. Traditional relational databases are powerful, but modern applications demand more flexibility, speed, and scalability. That’s where NoSQL databases come in. 𝐇𝐞𝐫𝐞 𝐚𝐫𝐞 𝐭𝐡𝐞 𝐭𝐨𝐩 9 𝐲𝐨𝐮 𝐬𝐡𝐨𝐮𝐥𝐝 𝐤𝐧𝐨𝐰: • MongoDB – Document-oriented, highly scalable, great for modern web apps. • Cassandra – Distributed, fault-tolerant, ideal for handling massive amounts of data. • Redis – In-memory key-value store, perfect for caching and real-time analytics. • Couchbase – Combines key-value store and document database, optimized for high performance. • Neo4j – Graph database, excels at mapping complex relationships. • Amazon DynamoDB – Fully managed, serverless NoSQL database for ultra-fast applications. • Apache HBase – Column-oriented, designed for big data on top of Hadoop. • Elasticsearch – Search engine database, enables powerful real-time search and analytics. • CouchDB – Document database, reliable replication and offline-first capabilities. Choosing the right database can make or break your application’s performance. Understand your data model, query patterns, and scalability needs before deciding. Follow Ashish Sahu for more insights

public_profile__reactions
11 Comments
Edmund T. liked this
Report this post
Edmund T. liked this

Dr. Miro Bada

Dr. Miro Bada

1mo

Edmund T. liked this
12 Phrases That Kill Work Conversations (and what to say instead) You hear them every day. They seem innocent. Until you realize they destroy work relationships fast. 🚨 After treating 30,000 patients, I finally understood this. Time to flip these conversation killers into something better. Here are the worst ones, and how to turn them into trust-builders: "You don't understand" ❌ "Maybe I can explain better..." ✅ "I already told you..." ❌ "Let me explain this another way..." ✅ "You should have..." ❌ "Next time, let's try..." ✅ "That's not my problem" ❌ "Let's find who can help..." ✅ "You're being too sensitive" ❌ "This sounds important to you - tell me why" ✅ "I'm just being direct" ❌ "Here's my perspective..." ✅ "As per my previous email..." ❌ "To recap the key points..." ✅ "No offense, but..." ❌ "Can I share some thoughts?" ✅ "Do what you want..." ❌ "Here's what I recommend..." ✅ "Fine, have it your way" ❌ "Let's find middle ground..." ✅ "Just deal with it" ❌ "How can I support you?" ✅ "Yeah, but..." ❌ "I hear you, and..." ✅ The pattern? Toxic phrases shut people down. Safe phrases help them open up. Next time you're about to use one of these: Breathe. Choose words that build bridges, not walls. ____ P.S. These phrases help me build my business career. What's a phrase that helps you at work? 💬 ♻️ Follow Dr. Miro Bada and share this to help others 📌 Save this post for future reference!⁣⁣⁣⁣ If you want a copy of my top 60+ infographics (free): 👉 Like, Repost, then Signup here: www.PeakProtocol.co

public_profile__reactions
139 Comments
Edmund T. liked this
Report this post
Edmund T. liked this

Shalini Goyal

Shalini Goyal

1mo

Edmund T. liked this
Every developer using Claude Code makes the same mistake first. They jump straight into coding. No plan. No context file. No rules defined. No tests written first. Just: "Write me a function that does X." Then 2 hours of refactoring what Claude built without understanding what you actually needed. That’s not a Claude problem. That’s a workflow problem. Here’s the workflow that fixes it The prompt that changes everything: "Explain how you'd solve X. No code yet." Three words - No code yet - force Claude into planning mode. Review the plan. Refine it. THEN execute. Every hour spent planning saves 3 hours of refactoring. Every Claude Code power user knows this. Almost nobody does it on day one. The other 7 rules that compound on top: 📌 CLAUDE.md → your rules, every session, automatically 📌 TDD → failing test first, implementation second 📌 Subagents → parallel tasks, not sequential bottlenecks 📌 Git Worktrees → one feature, one environment, zero conflicts 📌 /compact → clean context, consistent quality 📌 /cost → visible spend, no surprise bills 📌 Skills → standardized workflows, consistent team output 8 rules. All learnable in one afternoon. All changing your output permanently. Which rule do you wish you'd learned on day one? Save this, the Claude Code workflow guide every developer needs before day two. ♻️ Repost for every dev who jumped straight into coding and paid for it in refactoring hours.

public_profile__reactions
93 Comments

See all activities

Experience

Earth's Element (Landscape Architects, Designers, Landscaping)

Seattle, Washington, United States
-

United States
-

Seattle, Washington, United States
-

Auburn, Washington, United States
-

Microsoft Advanta B 3009 160th Ave SE, Bellevue, WA 98008
-

480 Houser Way N, Renton, WA 98057
-

6801 185th Ave NE #100, Redmond, WA 98052
-

1600 Smith St Houston, TX 77002
-

300 Trinity Campus Circle Cf3-203, Fort Worth, TX 76102
-

Las Vegas, NV
-

6450 Sequence Drive, San Diego, CA
-

999 Bishop St. Honolulu, HI
-

21312 30Th Dr. Se # 102 Bothell, WA 98021
-

Redmond, WA
-

Mountlake Terrace, WA
-

Seattle, WA
-

Seattle, WA
-

Redmond, WA
-

Bellevue, WA
-

Redmond, WA
-

Seattle, WA
-

London, UK
-

London, UK
-

London, UK

Education

University of Westminster

-

View Edmund’s full profile

See who you know in common
Get introduced
Contact Edmund directly

Join to view full profile

Other similar profiles

Vishal Parekh

Vishal Parekh

Greater Seattle Area

Connect
Andrew C.

Andrew C.

San Francisco, CA

Connect
Brook Kebede

Brook Kebede

Seattle, WA

Connect
Shuo LI

Shuo LI

Greater Seattle Area

Connect
Aditya Aggarwal

Aditya Aggarwal

San Francisco Bay Area

Connect
Parth Trivedi

Parth Trivedi

Redmond, WA

Connect
Jian Hua

Jian Hua

Greater Seattle Area

Connect
Carlos Augusto Mendoza Sanchez

Carlos Augusto Mendoza Sanchez

Redmond, WA

Connect
Atiqul Islam

Atiqul Islam

Irving, TX

Connect
Wei-Chung Chen

Wei-Chung Chen

Shanghai, China

Connect
Chandan Paranjape

Chandan Paranjape

Menlo Park, CA

Connect
Ankur Sadhoo

Ankur Sadhoo

Menlo Park, CA

Connect
Xuetao Yin

Xuetao Yin

San Francisco Bay Area

Connect
Andrew Qolta

Andrew Qolta

Waukee, IA

Connect
Vivek Shukla

Vivek Shukla

Greater Seattle Area

Connect
Xuechao Chen

Xuechao Chen

New York, NY

Connect
Cyrus Jamula

Cyrus Jamula

Greater Seattle Area

Connect
Hua zhang

Hua zhang

Greater Seattle Area

Connect
Apoorva Nagaraja

Apoorva Nagaraja

Bellevue, WA

Connect
Madhav P.

Madhav P.

Greater Seattle Area

Connect

Explore more posts

Explore top content on LinkedIn

Find curated posts and insights for relevant topics all in one place.

View top content

Add new skills with these courses

See all courses

Edmund T.

Seattle, Washington, United States 7K followers 500+ connections

See your mutual connections View mutual connections with Edmund Email or phone Password Show Forgot password? Sign in Sign in with Email or New to LinkedIn? Join now By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

About

Services

Articles by Edmund

Designing Scalable Enterprise Agentic AI: A 2026 Reference Architecture on Microsoft Foundry

Automation - AI Workflows - AI Agents

Activity

7K followers

Edmund T.

Edmund T.

Edmund T.

Edmund T.

Edmund T.

Edmund T.

Edmund T.

Edmund T.

Ravena O

Julia Danyal

Ashish Sahu

Dr. Miro Bada

Shalini Goyal

Experience

Earth's Element (Landscape Architects, Designers, Landscaping)

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

Education

University of Westminster

-

View Edmund’s full profile

Other similar profiles

Vishal Parekh

Andrew C.

Brook Kebede

Shuo LI

Aditya Aggarwal

Parth Trivedi

Jian Hua

Carlos Augusto Mendoza Sanchez

Atiqul Islam

Wei-Chung Chen

Chandan Paranjape

Ankur Sadhoo

Xuetao Yin

Andrew Qolta

Vivek Shukla

Xuechao Chen

Cyrus Jamula

Hua zhang

Apoorva Nagaraja

Madhav P.

Explore more posts

Explore top content on LinkedIn

Add new skills with these courses

Using Snowflake with Tableau

Advanced RAG Applications with Vector Databases

Complete Guide to Azure AI for ML Engineers by Microsoft Press

Seattle, Washington, United States
7K followers 500+ connections

View mutual connections with Edmund

Email or phone

Password

Forgot password?

or

New to LinkedIn? Join now

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.