Strategic Brief: LovEdu

Production RAG Platform for Kuwait University

Education Technology Published 2026-06 9 min read

Engagement

EdTech AI Platform

Duration

3 months

Production RAG Platform for Kuwait University - LovEdu | Seven Labs Case Study

The Operational Challenge

Kuwait University students had no reliable way to query course-specific material uploaded by professors. Generic AI tools hallucinated curriculum facts and fabricated institutional regulations, making them actively harmful in an academic context. The platform needed bilingual Arabic/English support, strict no-fabrication enforcement, and course-level data isolation - problems that generic RAG templates do not solve.

The Solution & Architecture

We designed and deployed LovEdu: a production-grade AI education platform built on a multi-stage RAG pipeline. The system uses LlamaParse for academic document parsing, Weaviate for native hybrid BM25 and vector search, Cohere multilingual reranking for Arabic/English parity, comprehensive query detection with sequential page-order fetch, follow-up intelligence, token-budgeted context management, and an admin-editable system prompt stored in MongoDB. All services run in an isolated Docker network on Coolify with JWT + 2FA authentication and courseId-level database isolation.

Why This Matters

Education is one of the highest-stakes domains for AI hallucination. A student who acts on a fabricated KU grade appeal policy misses their deadline. A student who receives scrambled course material before an exam fails based on bad information. The engineering challenge in LovEdu was not building a chatbot - it was building a system where the architecture itself makes hallucination structurally impossible: every answer must come from retrieved material, and if the material is absent, the system says so. This is the standard that any AI deployed in a high-stakes domain should meet, and it requires deliberate engineering choices at every layer of the pipeline.

Functional Logic Flow

Multilingual RAG Architecture

System Integration Phase

Built a hybrid Weaviate search pipeline combining BM25 keyword matching with dense vector retrieval in a single query call, fused via Relative Score Fusion - giving the LLM both exact-match accuracy and semantic understanding simultaneously.

Optimization & Dynamic Allocation

Integrated Cohere rerank-multilingual-v3.0 as the final retrieval gate, producing relevance scores natively in Arabic and English without any translation step - the only reranking approach that delivers consistent bilingual quality.

Hardening & Scale Validation

Implemented token-budgeted context management and follow-up query detection so long study sessions never degrade answer quality - the most recent context always fits within the LLM window regardless of how many messages came before.

Key Business Metrics

Zero

Hallucination Incidents

Arabic + English

Languages Supported

Up to 60 chunks

Chunk Retrieval Mode

5 AI Modules

Tools Deployed

Outcome: Zero hallucination incidents on course material across the first semester. Full bilingual Arabic/English retrieval quality without translation overhead. Comprehensive queries returned professor-structured explanations in original document order. Context integrity maintained across 100+ message sessions. KU regulatory questions answered with explicit citations rather than fabricated policy.

Engineered Tech Ecosystem

Next.js 14Node.jsWeaviateMongoDBCohere RerankGPT-4oLlamaParseGoogle EmbeddingsCoolifyDockerTraefik

Seven Labs Verified Agency

Seven Labs is an AI Systems Engineering firm based in Islamabad, Pakistan. Our team holds professional certifications from IBM, Google Cloud, EC-Council, and CyberWarfare Labs, and has delivered production systems for banking, SaaS, real estate, and media clients across three continents.

Verified Credentials Meet the Team All Case Studies →

Case study narratives are drafted with AI writing assistance and reviewed by Seven Labs engineers for technical accuracy. All metrics, stack details, and architectural decisions reflect real implementation patterns. Client names are withheld where confidentiality agreements apply.

Initiate a similar system architecture audit.

Every project we take on is engineered for measurable outcomes. Let's map out your systems and construct a scalable deployment workflow.

Schedule Auditing Call Contact Form Inquiry

Step	What Happens	Technology
Upload	Professor uploads file (max 50 MB) via admin portal	Cloudinary
Parse	File sent to LlamaParse. Handles tables, multi-column layouts, equations - returns clean Markdown	LlamaParse
Chunking	Text split into 1,000-character chunks with 200-character overlap. Overlap ensures concepts spanning a page boundary are never severed	Custom recursive splitter
Embedding	Each chunk converted to a dense vector capturing semantic meaning (768 or 1,536 dimensions)	Google text-embedding-004 or OpenAI text-embedding-3-small
Dual-Write	Every chunk written to Weaviate (hybrid search) and MongoDB (sequential access, fallback, page-order re-sort)	Weaviate + MongoDB

Parameter	Standard Query	Comprehensive Query
Initial retrieval	25 chunks via hybrid search	Up to 60 chunks fetched by text `chunkIndex` from MongoDB
Deduplication	Jaccard trigram (0.82)	Skipped - sequential chunks are inherently unique
Reranking	Top 7 via Cohere	Top 20 via Cohere
Final ordering	Relevance score order	Re-sorted into original page order after reranking
LLM token budget	4,096 tokens	8,192 tokens

Input	Expanded
text `gcr`	google classroom
text `ku`	kuwait university
text `oop`	object oriented programming
text `nlp`	natural language processing
text `db`	database
text `ds`	data structure
text `algo`	algorithm

Tool	Purpose
KU Student Rights Advisor	Grade appeals, GPA rules, academic probation - cites KU regulations exactly, never fabricates policy
Citation Formatter	APA 7th, MLA, Harvard per KU thesis requirements - strict format compliance
Success Stories	KU graduate journeys from uploaded PDFs only - no invented stories
What's Trendy	KU events and career trends from uploaded documents and the Eventat platform only

System Metric	Baseline (Generic AI)	LovEdu Production System	Outcome
Hallucination Rate on KU Regulations	High - fabricated policies	Zero - citation required or explicit "I don't know"	Eliminated
Arabic Query Retrieval Quality	Degraded - no native Arabic reranking	Full quality - Cohere multilingual reranker	Parity with English
Follow-up Query Coherence	Broken - follow-ups retrieve unrelated material	Intact - last substantive query reused for retrieval	Maintained
Comprehensive Answer Structure	Random ordering	Page-order sequential, professor-structured	Coherent
Context Integrity at 100+ Messages	Degraded - naive history overflow	Maintained - token-budgeted trimming	Preserved

Production RAG Platform for Kuwait University

The Operational Challenge

The Solution & Architecture

Why This Matters

Multilingual RAG Architecture

System Integration Phase

Optimization & Dynamic Allocation

Hardening & Scale Validation

Initiate a similar system architecture audit.

Technical Deep Dive

Case Study: LovEdu - Production RAG Platform for Kuwait University

Executive Summary

Business Problem

Technical Challenges

Parsing Complex Academic Documents

Chunk Boundary Precision

Bilingual Retrieval Without Translation

Comprehensive vs. Targeted Query Disambiguation

Long-Session Context Degradation

Prompt Security in a Multi-Tenant Environment

Solution Architecture

Document Ingestion Pipeline

Technology Stack

Implementation Process

Phase 1: Document Ingestion Pipeline

Phase 2: Hybrid Search Configuration and Tuning

Phase 3: Reranking, Query Intelligence, and Comprehensive Mode

Phase 4: Context Management and Embedding Cache

Phase 5: Security Hardening, System Prompts, and Tool Pages

Security Considerations

Course-Level Data Isolation

Prompt Injection Prevention

Role-Based Access Control

No-Fabrication Policy at Architecture Level

Performance Optimizations

Embedding Cache Eliminates Redundant API Calls

Weaviate Single-Call Hybrid Search

Token-Budgeted Context Prevents Prompt Bloat

SSE Streaming Eliminates Perceived Latency

Health-Check-Gated Container Startup

Results & Outcomes

Lessons Learned

The Reranker Is the Most Important Component After Chunking

Sequential Fetch Outperforms Pure Vector Retrieval for Comprehensive Queries

Token Budgeting Must Be Character-Aware, Not Message-Count-Aware

Dual-Write Is Worth the Storage Overhead

Admin-Editable System Prompts Reduce Engineering Dependency

Frequently Asked Questions (FAQs)

1. How does the system ensure a student in one course cannot access another course's material?

2. Why was Weaviate chosen over Pinecone or Qdrant for this deployment?

3. What happens when a student asks a question the uploaded course material does not cover?

4. How does follow-up detection avoid false positives - treating a genuine new question as a follow-up?

5. What is the latency profile of a standard query end-to-end?

6. How are system prompt updates applied without downtime?

Schema & SEO Metadata

Internal Linking Anchors