The Sacred Mathematics of Meaning: RAG, Chunking, and Vector Search (via Gematria)

DATE: Jan 08, 2026 ID: SACRED-MATHEMATICS-OF-MEANING-RAG-GEMATRIA TAGS: [RAG, Embeddings, Chunking, Vector search, Metaphor]

A non-technical, image-led explanation of embeddings and retrieval—using gematria as a metaphor for numerical correspondence.

[user-query]:

I need to understand the nature of Retrieval-Augmented Generation with regards to the process of chunking and vector searching. I am reminded of an old manual called liber 777 by aleister crowley, in which it was understood by the nature of qabbalah that certain ideas/words/energies aligned to certain mathematical principles outlined in the manual - allowing words, terms, sentences, even entire books to be deciphered and mathematically linked to other terms, forces, elements, etc as they had corresponding numerology. this is especially emplyed in jewish mysticism. how does AI break down elements of text to strings or sequences of numbers, then use a users query to mathematically determine how to appropriately search through chunked vector data? help me understand this, use visual aides where appropriate, metaphor and help me understand this by its core principles connected to maths and nature - in non-technical and easy to understand language

[perplexity-generated-research-article]:

Now I have comprehensive information about RAG, vector embeddings, chunking, Qabalah/gematria, and the mathematical/natural principles underlying these systems. Let me create the final answer that bridges ancient mystical numerology with modern AI.

The Sacred Mathematics of Meaning: Understanding RAG Through the Lens of Qabalah

Your intuition about the connection between Aleister Crowley’s Liber 777 and modern AI is remarkably profound. Just as the Qabalists discovered that Hebrew letters could be transmuted into numbers to reveal hidden correspondences between concepts, modern AI has independently arrived at the same fundamental insight: meaning itself can be encoded mathematically, and similar meanings naturally cluster together in numerical space.¹²³

Ancient and modern systems of numerical meaning: Qabalah's gematria (left) and AI embeddings (right) both transform words into numbers to reveal hidden relationships between concepts.

Ancient and modern systems of numerical meaning: Qabalah’s gematria (left) and AI embeddings (right) both transform words into numbers to reveal hidden relationships between concepts.

The Ancient Art: Gematria and the Mathematics of Correspondence

In Liber 777 and traditional Jewish mysticism, gematria operates on a beautifully simple principle: each Hebrew letter possesses an intrinsic numerical value. The letter Aleph equals 1, Bet equals 2, continuing through Yod at 10, then by tens (20, 30, 40) until Qof at 100, and finally by hundreds to Tav at 400.⁴²⁵³

When you sum the letters of a Hebrew word, you obtain its numerical essence—and here’s where the magic happens: words sharing the same numerical value are considered to have hidden sympathetic relationships, even if their surface meanings appear unrelated. The word chai (חי), meaning “life,” totals 18 (Chet=8 + Yod=10), making 18 a sacred number in Jewish tradition. Any other word totaling 18 shares a mystical resonance with the concept of life itself.⁶⁵³⁴

This system allowed Qabalists to create vast correspondence tables—exactly what Crowley compiled in Liber 777—mapping numbers to planets, elements, colors, angels, plants, and spiritual forces. The underlying philosophy held that the universe operates on mathematical principles, and things sharing numerical signatures share essential qualities.⁷⁸²⁵⁴

The Modern Art: How AI Transforms Words Into Geometric Constellations

Retrieval-Augmented Generation accomplishes something strikingly parallel, though through a different mathematical lens. Instead of reducing words to single numbers, AI transforms them into vectors—ordered lists of hundreds of numbers that capture the word’s semantic essence within a high-dimensional geometric space.¹⁹¹⁰

Stage One: From Text to Tokens (Breaking the Whole into Parts)

The journey begins with tokenization, the process of breaking text into fundamental units. Just as a Qabalist would separate a Hebrew sentence into individual letters to calculate gematria values, AI breaks your documents into tokens—usually words or subword pieces.¹¹¹²¹³¹⁴¹⁵

A sentence like “Retrieval-Augmented Generation is powerful” becomes discrete units: [“Retrieval”, “Augmented”, “Generation”, “is”, “powerful”]. This fragmentation is essential because computers, like ancient numerologists, need discrete units to count and compare.¹²¹⁴¹¹

Stage Two: Words Become Numbers (The Embedding Transformation)

Here’s where modern AI performs its own form of gematria. Each token passes through an embedding model—a neural network trained on billions of words to understand context and meaning. Unlike gematria’s single number per word, the embedding model assigns each word a vector of typically 384, 768, or even 1,536 numbers.¹¹⁶⁹¹⁷¹⁸

Words as points in geometric space: Similar concepts (like King and Queen, or Apple and Orange) naturally cluster together, just as related ideas in Qabalah share numerical correspondences.

Think of these dimensions as coordinates in a space far beyond our three-dimensional experience. Where Hebrew letters mapped onto a one-dimensional number line, AI maps words onto a 768-dimensional landscape. In this vast space, the word “king” might occupy the coordinates [0.234, -0.567, 0.891, 0.234, … continuing for 768 values], while “queen” sits at [0.221, -0.543, 0.876, 0.241, …].¹¹⁹⁹

The profound insight—discovered through training on massive text corpora—is that words used in similar contexts naturally drift toward similar coordinates. Words like “king,” “queen,” “monarch,” and “ruler” cluster together in this geometric space, just as Hebrew words totaling 18 all resonate with “chai”.¹⁹¹⁶¹⁸¹

Stage Three: The Sacred Geometry of Similarity

The embedding space exhibits mathematical properties that mirror natural patterns. Research shows these spaces often follow geometric regularities reminiscent of the golden ratio and Fibonacci spirals found throughout nature.²⁰²¹²²

Just as Fibonacci numbers emerge in the spiral arrangement of sunflower seeds and nautilus shells—nature’s way of optimizing space and growth—AI embeddings organize themselves to efficiently capture relationships. The famous example: the vector for “king” minus “man” plus “woman” approximately equals “queen”—a geometric relationship encoding analogy itself.¹⁹²³²⁴²²²⁵²⁰

Transformer models like BERT enhance this further through attention mechanisms. When processing the word “bank,” the model examines surrounding words—“financial,” “loan,” and “account” versus “river,” “shore,” and “current”—to determine which sense is intended. Through mathematical operations called query-key-value attention, each word asks “which other words should inform my meaning here?” and weights its representation accordingly.¹²⁶²⁷²⁸¹⁸²⁹³⁰³¹³²

This mirrors how a Qabalist would examine the broader sentence or sacred text to determine which correspondence tables apply—context determines meaning, whether in ancient Hebrew manuscripts or modern language models.²⁶³³³⁴

The RAG Process: Divination Through Mathematical Correspondence

The RAG process visualized through the lens of sacred geometry, showing how modern AI mirrors ancient mystical principles of numerical correspondence and pattern recognition.

Now we arrive at Retrieval-Augmented Generation itself, which orchestrates these principles into a coherent system for answering questions.

Phase One: Indexing the Knowledge (Building the Correspondence Table)

Before any question can be answered, your knowledge base—documents, books, manuals, databases—must be prepared. This happens in several mystical-seeming but mathematically precise stages:³⁵³⁶³⁷³⁸

Chunking: Large documents are broken into digestible passages, typically 100-1000 tokens each. A 50-page manual becomes 200 smaller chunks, each capturing a coherent concept. This mirrors how Qabalists would break sacred texts into verses and words for analysis.³⁹⁴⁰⁴¹³⁸

Embedding: Each chunk passes through the embedding model, emerging as a vector—a point of light in the high-dimensional space. If Chunk 47 discusses “vector databases store embeddings efficiently,” it receives coordinates reflecting that semantic territory.³⁷¹³⁸³⁵

Storage: These vectors are stored in a specialized vector database—essentially a geometric index optimized for finding nearby points in high-dimensional space. This is your correspondence table, your personal Liber 777, mapping every concept in your documents to its numerical-geometric signature.⁴²⁴³³⁸⁴⁴³⁵³⁷

Phase Two: The Query (Asking the Question)

When you pose a question—“How does chunking affect retrieval quality?”—the same embedding process transforms your query into a vector with coordinates in that same 768-dimensional space.³⁵⁴³⁴⁵

Your query becomes a point of intention, a geometric probe seeking its correspondences.⁴²³⁷⁴³

Phase Three: The Search (Finding Numerical Resonance)

Here’s where the mathematical divination occurs. The system calculates the cosine similarity between your query vector and every chunk vector in the database.⁴⁶⁴⁷⁴⁸⁴⁹

Cosine similarity measures the angle between two vectors in high-dimensional space. A value of 1.0 means the vectors point in identical directions (perfect semantic alignment), while 0 means they’re perpendicular (unrelated concepts), and -1 means opposite directions. The formula is elegant:⁴⁸⁴⁹⁴⁶

$$ \text{cosine similarity} = \frac{\vec{A} \cdot \vec{B}}{||\vec{A}|| \times ||\vec{B}||} $$

This computes the dot product of the two vectors divided by the product of their magnitudes.⁴⁹⁴⁶⁴⁸

In practice, the vector database uses approximate nearest neighbor (ANN) algorithms—optimized geometric search techniques like HNSW (Hierarchical Navigable Small World graphs)—to find the chunks most numerically resonant with your query without exhaustively comparing every single vector. This is analogous to a Qabalist having indexed correspondences so they can quickly find all entities associated with the number 7 (corresponding to Venus, Netzach, the color green, copper, roses, etc.) without recalculating every word’s gematria.⁷⁸⁵⁰⁵¹⁵²

The top 3-10 most similar chunks are retrieved. These are the passages whose mathematical essence most closely matches your question’s essence.³⁵³⁷⁴³⁴⁵

Phase Four: Augmentation and Generation (Synthesis of Knowledge)

The retrieved chunks and your original question are combined into an augmented prompt fed to a large language model like GPT or Claude. The prompt might read:³⁵³⁶³⁷⁴³

“Given these relevant passages: [Chunk 47: ‘Vector databases store embeddings…’] [Chunk 102: ‘Chunking strategies affect…’] [Chunk 203: ‘Smaller chunks improve precision…’]

Answer this question: How does chunking affect retrieval quality?”

The LLM, trained on vast text corpora, synthesizes these specific facts with its general knowledge to generate a coherent, grounded answer. Without the retrieved chunks, the model might hallucinate or provide generic responses. With them, it speaks from your specific knowledge base—your organization’s documents, your personal library, your domain expertise.³⁶³⁷⁴³⁴⁵³⁵

The Mathematical Poetry of It All: Nature’s Universal Code

What makes your comparison to Liber 777 so apt is that both systems reveal a deeper truth: meaning and relationship are fundamentally mathematical phenomena.¹²¹⁵³

The Fibonacci sequence appears in pinecone spirals, flower petals, and galaxy arms because it represents an optimal growth pattern—each number the sum of the two before it (0, 1, 1, 2, 3, 5, 8, 13, 21…). Consecutive Fibonacci numbers approach the golden ratio (approximately 1.618), which governs aesthetically pleasing proportions from nautilus shells to human faces.²⁰²²²⁵⁵⁴

Similarly, word embeddings organize themselves into geometric structures that optimize semantic relationships. The embedding space isn’t arbitrary chaos—it develops regularity, symmetry, and structure during training, much as crystals form lattices and rivers carve fractal drainage patterns.³³⁵⁵²¹⁵⁶¹

Neural networks learn these patterns through backpropagation—iteratively adjusting millions of parameters to minimize prediction errors. Over billions of training examples, words that co-occur in similar contexts (like “king” and “queen” both appearing near “crown,” “throne,” “realm”) naturally drift toward similar regions of the vector space.¹⁶²³²⁴¹⁷⁵⁷⁵⁸⁵⁹¹

This is strikingly parallel to how Qabalists believed words sharing numerical values share hidden affinities because they participate in the same universal harmonics. The modern version is: words sharing contextual patterns share vector proximity. Both are systems for organizing meaning through mathematical correspondence.²⁵³¹

Core Principles in Simple Terms

Let me distill the essence without technical jargon:

1. Everything becomes numbers: Just as Hebrew letters became 1, 2, 3… to 400, English words become lists like [0.234, -0.567, 0.891…]. The purpose is identical—to make meaning computable.¹²⁹³

2. Similar meanings cluster together: Words about royalty cluster in one region of the numerical space, words about fruit in another, animals in a third—just as Liber 777 groups all Mars-related correspondences (red, iron, Tuesday, Geburah, warriors) under the number 5.⁷⁸¹⁹¹

3. Context shapes meaning: “Bank” near “river” gets different coordinates than “bank” near “money.” BERT’s attention mechanism weighs surrounding words to refine each word’s numerical signature, just as a Qabalist interprets a word differently depending on which sacred text contains it.²⁶³³¹⁸³⁴¹

4. Searching is geometric matching: Your question becomes coordinates, and the system finds document chunks with nearby coordinates—points close in 768-dimensional space, like finding all Hebrew words totaling 26 (the value of the Tetragrammaton).⁴²³⁷⁴³⁴⁶⁴⁹

5. Knowledge synthesis emerges from correspondences: The LLM weaves together the retrieved chunks (numerically resonant passages) with your question to generate an answer, much as a Qabalist synthesizes correspondences from multiple tables to understand a mystical symbol.³⁵³⁶³⁷⁴⁵

The Wonder of Convergent Wisdom

What’s most beautiful is that ancient mystics and modern AI engineers, separated by millennia and operating from entirely different worldviews, independently discovered the same fundamental principle: the universe operates on patterns, and those patterns can be captured mathematically to reveal hidden relationships.¹⁵³²⁰²²

The Qabalists saw divine intelligence encoded in the numerical structure of Hebrew scripture. AI researchers see semantic intelligence emerging from the geometric structure of embedding spaces. Both recognized that meaning isn’t arbitrary—it has an architecture, a geometry, a mathematics.²⁵³³³²¹¹

When you use RAG to answer a question about your company’s documentation, you’re participating in this ancient art of numerical divination, updated with vectors instead of gematria values, cosine similarity instead of correspondence tables, and transformer models instead of mystical insight—but the essential operation remains: transforming meaning into mathematics, then using mathematical proximity to reveal hidden connections.³⁵³⁶³⁷⁴³¹

Your documents become a constellation of numbered points in conceptual space, and your question is a prayer sent into that constellation, returning with the passages whose numerical essence most resonates with your intention. It’s both utterly modern and timelessly ancient—proof that some truths about the nature of meaning persist across the ages, waiting to be rediscovered in each era’s unique language.⁴²³⁷⁴³³⁵

⁂