Hallucination-prevention RAG system with verbatim span extraction. Ensures all generated content is grounded in source documents with exact citations.
-
Updated
May 18, 2026 - Python
Hallucination-prevention RAG system with verbatim span extraction. Ensures all generated content is grounded in source documents with exact citations.
Easy modernBERT fine-tuning and multi-task learning
ModernBERT model optimized for Apple Neural Engine.
2nd place · Detect AI-generated text across 6 classes · ModernBERT + LDAM + per-class ensemble · Macro F1 0.95919 — MALTO Hackathon, Politecnico di Torino
Lightweight, Self-Hosted AI Guardrails Model based on ModernBERT.
ClassyText is a demo for zero-shot text classification using ModernBERT-large from Hugging Face.
End-to-end pipeline that identifies specialized research papers through automated classification, demonstrated with an LLMOps use case that includes data ingestion, model training, evaluation, and deployment.
Code for EXIST 2025 Task 1.1, 1.2 and 1.3
Challenge to distinguish whether a sentence from a news article expresses the subjective view of the author behind it or presents an objective view on the covered topic
AI-generated text detector using ModernBERT with a Streamlit UI for classifying text as human or AI-written
Trajectory Classification with the new architecture of BERT, the 'ModernBERT'
High-performance, edge-native compliance engine for the Fair Housing Act (FHA). Powered by ModernBERT, providing privacy-first local inference to detect real estate violations in real-time.
Verite! — Cross-domain deception detection with ModernBERT
Fine-tuned ModernBERT for software industry related article summaries
We introduce a binarized approach to Lexical Complexity Prediction (Binary LCP) and systematically compare two generations of encoder-only Transformer models: BERT and ModernBERT. Work completed as part of Natural Language Processing, DATASCI 266.
Data Processing Utilities And Training Code for r/changemyview Dataset
This is my attempt at writing an ai detector api by fine-tuning modernBert. The story about this project is discussed in the readme. You can find a link to the model playground below.
An example workflow for fine-tuning ModernBERT for a classification task using the IMDB dataset.
A finetuned ModernBERT model for named entity recognition (NER), trained on the CoNLL-2003 dataset to identify persons, organizations, locations, and miscellaneous entities in english text
Detect duplicate & unused Python code via AST hashing, Jaccard similarity, and semantic embeddings (ModernBERT, C2LLM, EmbeddingGemma). CLI + Python API w hybrid synthesis
Add a description, image, and links to the modernbert topic page so that developers can more easily learn about it.
To associate your repository with the modernbert topic, visit your repo's landing page and select "manage topics."