Weekly Update #3
Weekly Update: Major Milestones!
Here’s what we accomplished this week:
1. Exam Generation Page and UI Refactoring
Kahu developed the Exam Generation Page, which allows students to create exams tailored to their needs. Alongside this, he refactored all other pages on the site to ensure a consistent look and feel.
- Responsive Design Fix: The site was also updated to work smoothly across all screen sizes. Watch here.
- Exam Generation Demo: Watch here.
2. Programming IDE Development
Kahu developed a browser-based programming interface (IDE) that integrates programming question generation. This serverless IDE now supports:
- Question generation through the VAST AI Qwen Coder endpoint.
- Test case execution, with clear pass/fail indicators for students.
Here’s what it looks like in action:
This achievement provides a fully functional, in-browser IDE for programming practice and evaluation.
3. Research and Development on RAG
Joel made significant progress on Retrieval-Augmented Generation (RAG):
- Set up a local development environment for RAG on structured data using Ollama and R2R-lite, allowing experimentation with different models and configurations.
- Researched modern RAG methods to understand which algorithms suit varying levels of needs.
- Determined that a hybrid approach combining an agent with reranking, HYDE, and RAG-fusion is suitable for current goals.
- Finalized the RAG implementation plan using:
- vLLM for R2R-based completions.
- Qwen model for temporary completions.
- mxbai-embed-large as the embedding model.
- bge-reranker-v2-m3 as the reranker.
4. TAG System Prototyping
Lev started exploring the TAG (Topic Annotation and Grouping) system for integration with the RAG implementation:
- Began prototyping a TAG system for future use.
- Researched current state-of-the-art TAG pipelines to inform design decisions.
- Started designing a TAG framework to align with the current and planned RAG implementations.
These updates reflect our continued commitment to building a reliable, student-focused platform. Stay tuned for more progress next week!