Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up

All HF Hub posts

Kseniaseย 
posted an update 2 days ago
view post
Post
4107
11 new types of RAG

RAG is evolving fast, keeping pace with cutting-edge AI trends. Today it becomes more agentic and smarter at navigating complex structures like hypergraphs.

Here are 11 latest RAG types:

1. InstructRAG -> InstructRAG: Leveraging Retrieval-Augmented Generation on Instruction Graphs for LLM-Based Task Planning (2504.13032)
Combines RAG with a multi-agent framework, using a graph-based structure, an RL agent to expand task coverage, and a meta-learning agent for better generalization

2. CoRAG (Collaborative RAG) -> CoRAG: Collaborative Retrieval-Augmented Generation (2504.01883)
A collaborative framework that extends RAG to settings where clients train a shared model using a joint passage store

3. ReaRAG -> ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented Generation (2503.21729)
It uses a Thought-Action-Observation loop to decide at each step whether to retrieve information or finalize an answer, reducing unnecessary reasoning and errors

4. MCTS-RAG -> MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search (2503.20757)
Combines RAG with Monte Carlo Tree Search (MCTS) to help small LMs handle complex, knowledge-heavy tasks

5. Typed-RAG - > Typed-RAG: Type-aware Multi-Aspect Decomposition for Non-Factoid Question Answering (2503.15879)
Improves answers on open-ended questions by identifying question types (a debate, personal experience, or comparison) and breaking it down into simpler parts

6. MADAM-RAG -> Retrieval-Augmented Generation with Conflicting Evidence (2504.13079)
A multi-agent system where models debate answers over multiple rounds and an aggregator filters noise and misinformation

7. HM-RAG -> HM-RAG: Hierarchical Multi-Agent Multimodal Retrieval Augmented Generation (2504.12330)
A hierarchical multi-agent RAG framework that uses 3 agents: one to split queries, one to retrieve across multiple data types (text, graphs and web), and one to merge and refine answers

8. CDF-RAG -> CDF-RAG: Causal Dynamic Feedback for Adaptive Retrieval-Augmented Generation (2504.12560)
Works with causal graphs and enables multi-hop causal reasoning, refining queries. It validates responses against causal pathways

To explore what is Causal AI, read our article: https://www.turingpost.com/p/causalai

Subscribe to the Turing Post: https://www.turingpost.com/subscribe

Read further ๐Ÿ‘‡
  • 1 reply
ยท
fantosย 
posted an update about 24 hours ago
view post
Post
1639
๐ŸŽจ BadgeCraft: Create Beautiful Badges with Ease! โœจ
Hello there! Today I'm introducing BadgeCraft, a simple app that lets you create stunning badges for your websites, GitHub READMEs, and documentation.

๐ŸŒŸ Key Features

๐Ÿ–Œ๏ธ 14 diverse color options including vibrant neon colors
๐Ÿ”ค Custom text input for label and message
๐Ÿ–ผ๏ธ Support for 2000+ logos via Simple Icons
๐Ÿ”— Clickable link integration
๐Ÿ‘๏ธ Real-time preview
๐Ÿ’ป Ready-to-use HTML code generation

๐Ÿ“ How to Use

Label - Enter the text to display on the left side of the badge (e.g., "Discord", "Version", "Status")
Message - Enter the text to display on the right side of the badge
Logo - Type the name of a logo provided by Simple Icons (e.g., "discord", "github")
Style - Choose the shape of your badge (flat, plastic, for-the-badge, etc.)
Color Settings - Select background color, label background color, and logo color
Link - Enter the URL that the badge will link to when clicked

โœ… Use Cases

Add social media links to your GitHub project README
Display version information or download links on your website
Include tech stack badges in blog posts
Show status indicators in documentation (e.g., "in development", "stable")

๐Ÿ’ก Tips

Click on any of the prepared examples to automatically fill in all settings
Copy the generated HTML code and paste directly into your website or blog
HTML works in GitHub READMEs, but if you prefer markdown, use the ![alt text](badge URL) format

๐Ÿ‘จโ€๐Ÿ’ป Tech Stack
This app was built using Gradio and leverages the shields.io API to generate badges. Its simple UI makes it accessible for everyone!

๐Ÿ”— openfree/Badge

โœจ Available under MIT License - feel free to use and modify.
  • 1 reply
ยท
seawolf2357ย 
posted an update 2 days ago
view post
Post
4264
๐Ÿ“š Papers Leaderboard - See the Latest AI Research Trends at a Glance! โœจ

Hello, AI research community! Today I'm introducing a new tool for exploring research papers. Papers Leaderboard is an open-source dashboard that makes it easy to find and filter the latest AI research papers.

Heartsync/Papers-Leaderboard

๐ŸŒŸ Key Features

Date Filtering: View only papers published within a specific timeframe (from May 5, 2023 to present)
Title Search: Quickly find papers containing your keywords of interest
Abstract Search: Explore paper content more deeply by searching for keywords within abstracts
Automatic Updates: The database is updated with the latest papers every hour

๐Ÿ’ก How to Use It?

Select a start date and end date
Enter keywords you want to find in titles or abstracts
Adjust the maximum number of search results for abstract searches
Results are displayed neatly in table format
aiqtechย 
posted an update 2 days ago
view post
Post
3226
๐ŸŒ AI Token Visualization Tool with Perfect Multilingual Support

Hello! Today I'm introducing my Token Visualization Tool with comprehensive multilingual support. This web-based application allows you to see how various Large Language Models (LLMs) tokenize text.

aiqtech/LLM-Token-Visual

โœจ Key Features

๐Ÿค– Multiple LLM Tokenizers: Support for Llama 4, Mistral, Gemma, Deepseek, QWQ, BERT, and more
๐Ÿ”„ Custom Model Support: Use any tokenizer available on HuggingFace
๐Ÿ“Š Detailed Token Statistics: Analyze total tokens, unique tokens, compression ratio, and more
๐ŸŒˆ Visual Token Representation: Each token assigned a unique color for visual distinction
๐Ÿ“‚ File Analysis Support: Upload and analyze large files

๐ŸŒ Powerful Multilingual Support
The most significant advantage of this tool is its perfect support for all languages:

๐Ÿ“ Asian languages including Korean, Chinese, and Japanese fully supported
๐Ÿ”ค RTL (right-to-left) languages like Arabic and Hebrew supported
๐Ÿˆบ Special characters and emoji tokenization visualization
๐Ÿงฉ Compare tokenization differences between languages
๐Ÿ’ฌ Mixed multilingual text processing analysis

๐Ÿš€ How It Works

Select your desired tokenizer model (predefined or HuggingFace model ID)
Input multilingual text or upload a file for analysis
Click 'Analyze Text' to see the tokenized results
Visually understand how the model breaks down various languages with color-coded tokens

๐Ÿ’ก Benefits of Multilingual Processing
Understanding multilingual text tokenization patterns helps you:

Optimize prompts that mix multiple languages
Compare token efficiency across languages (e.g., English vs. Korean vs. Chinese token usage)
Predict token usage for internationalization (i18n) applications
Optimize costs for multilingual AI services

๐Ÿ› ๏ธ Technology Stack

Backend: Flask (Python)
Frontend: HTML, CSS, JavaScript (jQuery)
Tokenizers: ๐Ÿค— Transformers library
ยท
ginipickย 
posted an update 2 days ago
view post
Post
3228
๐Ÿค– AI Academic Paper Generator: Your Research Partner ๐ŸŽ“

Hello, researchers! Today I'm introducing my AI Academic Paper Generation System. This application is built with Streamlit and provides AI agents to assist with every stage of the academic research process.

ginipick/AgentX-Papers

โœจ Key Features

๐Ÿ“š Literature Research: AI reviews and summarizes relevant research
๐Ÿ“ Paper Outline: Generates a well-structured paper outline
โœ๏ธ Draft Writing: Creates a paper draft based on your research topic
๐Ÿ”— Citation Generation: Automatically generates academic citations
๐Ÿ–‹๏ธ Editing & Polishing: Checks grammar, context, and logical flow
๐ŸŒ Multilingual Support: Interface available in English and Korean

๐Ÿš€ How to Use

Enter basic information like research topic, paper title, and deadline
AI agents generate everything from literature review to final paper
Download your completed paper or consult with the chatbot for further assistance

๐Ÿ’ก What Makes It Special
This tool integrates all stages of academic research. Going beyond simple text generation, it mimics the actual research process to produce higher quality papers.
Visualization features and social media sharing options will be added in the next update! ๐Ÿ’ช

#AIResearch #AcademicWriting #ResearchAssistant #ArtificialIntelligence
openfreeย 
posted an update 1 day ago
view post
Post
2386
๐Ÿ“Š Papers Impact: Instant AI Grading for Your Research Papers! ๐Ÿš€

๐ŸŒŸ Introduction
Hello, AI research community! ๐ŸŽ‰
Introducing Papers Impact - the revolutionary AI tool that automatically grades and predicts the potential impact of research papers! ๐Ÿง ๐Ÿ’ก

VIDraft/PapersImpact

โœจ Key Feature: Instant Paper Grading
The core functionality is brilliantly simple: Just enter an arXiv paper ID or URL, and our AI instantly analyzes and grades the paper's potential academic impact! No need to read through the entire paper yourself - our system automatically evaluates the title and abstract to generate a normalized impact score between 0 and 1.
๐ŸŽฏ How It Works

Enter Paper ID or URL: Simply paste an arXiv ID (e.g., "2504.11651") or full URL
Automatic Fetching: The system retrieves the paper's title and abstract
AI Analysis: Our advanced LLaMA-based transformer model analyzes the content
Instant Grading: Receive an impact score and corresponding letter grade in seconds!

๐Ÿ’ก Who Can Benefit?

๐Ÿ”ฌ Researchers: Pre-assess your paper before submission
๐Ÿ“š Students: Quickly gauge the quality of papers for literature reviews
๐Ÿซ Educators: Objectively evaluate student research
๐Ÿ“Š Research Managers: Prioritize which papers to read in depth
๐Ÿงฉ Journal Editors: Get an AI second opinion on submissions

๐Ÿš€ Technical Details
Our model is trained on an extensive dataset of published papers in CS.CV, CS.CL, and CS.AI fields, using NDCG optimization with Sigmoid activation and MSE loss. It's been rigorously cross-validated against historical citation data to ensure accurate impact predictions.
  • 2 replies
ยท
openfreeย 
posted an update 3 days ago
view post
Post
4116
๐Ÿง  ThinkFlow: The Revolutionary Platform That Gives LLMs the Power to Think ๐Ÿš€

Hello AI community! We're excited to introduce you to ThinkFlow, an innovative service that transforms how language models solve problems. ๐ŸŽ‰
VIDraft/ThinkFlow-llama

โœจ What is ThinkFlow?
ThinkFlow is a groundbreaking platform that automatically applies step-by-step reasoning capabilities to existing LLM models without any modifications. It makes complex problem-solving transparent, allowing you to witness the model's thought process in real-time.

๐Ÿ” Key Features

Reasoning Without Model Modifications: Add step-by-step reasoning while utilizing existing LLMs as they are โš™๏ธ
Visualized Thinking Process: See exactly how the model analyzes and solves problems ๐Ÿ‘๏ธ
Before & After Comparison: Compare standard responses with reasoning-enhanced outputs in real-time ๐Ÿ“Š
Improved Accuracy: Deliver more accurate solutions for complex math and logic problems ๐Ÿ“ˆ
Educational Value: Teach students systematic approaches to problem-solving ๐Ÿ‘จโ€๐Ÿซ
User-Friendly Interface: Intuitive and easy-to-use UI for seamless experience ๐Ÿ–ฅ๏ธ

๐Ÿ’ก What Problems Can It Solve?
ThinkFlow is particularly effective for various domains including:

Complex mathematical problems ๐Ÿงฎ
Logic puzzles ๐Ÿงฉ
Questions requiring multi-step reasoning ๐Ÿค”
Scientific analysis challenges ๐Ÿ”ฌ
Complex decision-making processes ๐Ÿ“

๐Ÿ‘จโ€๐Ÿ’ป Technical Details
ThinkFlow is built on the meta-llama/Llama-3.1-8B-Instruct model and uses carefully designed prompt chains to guide the model through step-by-step thinking. Each reasoning step builds upon the results of previous steps, culminating in a comprehensive final answer.

๐Ÿ’ฌ Join Our Community!
If you have questions or suggestions about ThinkFlow, join our Discord community: https://discord.gg/openfreeai
Let's build better AI reasoning experiences together! ๐Ÿ’ช

#AI #LLM #ReasoningAI #ThinkFlow #HuggingFace #OpenSource #AIEducation
ยท
hesamationย 
posted an update 1 day ago
view post
Post
1813
The best researchers from DeepSeek, OpenAI, Microsoft, and ByteDance explored RL and Reasoning in LLMs,

Here's some of their key findings:

1/ RL can further improve distilled models. These models are essentially SFT fine-tuned with the data generated by larger models, and the SFT+RL combo does not disappoint.

This is verified in the DeepSeek-R1 paper.

2/ both GRPO and PPO algorithms suffer from length bias; they encourage longer responses. This can be tackled by introducing explicit rewards based on the length of the answer.

3/Most reasoning research is focused on code and math. But training models on logic puzzles improves them for mathematical tasks too.

This shows the RL reasoning is generalized beyond the specific domain knowledge.

Previous research also shows RL can be a great generalizer.

4/The reasoning might not be only induced by RL; it might already be hidden in the base models due to the pre-training and CoT data they were trained on.

So while RL does wake up the reasoning beast, maybe it's not the only solution (e.g. other methods such as distillation)

5/ back to the length bias; reasoning models tend to generate longer responses for wrong answers. RL might be the culprit.

RL favours longer answers when the reward is negative, to dilute the penalty per individual token and lower the loss.

This might explain the "aha" moments!

6/ OpenAI's competitive programming paper showed an interesting finding:

o3 can learn its own test-time strategies (like writing an inefficient but correct solution to verify the answer of an optimized solution)

RL helps LLMs develop their own reasoning & verification methods.
The recent article by @rasbt helped me a lot in getting a broad view of the recent research on reasoning models.

He also lists more influential papers on this topic, It's a must-read if you're interested.

check it out ๐Ÿ‘‡
https://magazine.sebastianraschka.com/p/the-state-of-llm-reasoning-model-training
Jawardย 
posted an update 1 day ago
as-cle-bertย 
posted an update about 17 hours ago
view post
Post
1076
Finding a job that matches with our resume shouldn't be difficult, especially now that we have AI... And still, we're drowning in unclear announcements, jobs whose skill requirements might not really fit us, and tons of material๐Ÿ˜ตโ€๐Ÿ’ซ
That's why I decided to build ๐‘๐ž๐ฌ๐ฎ๐ฆ๐ž ๐Œ๐š๐ญ๐œ๐ก๐ž๐ซ (https://github.com/AstraBert/resume-matcher), a fully open-source application that scans your resume and searches the web for jobs that match with it!๐ŸŽ‰
The workflow is very simple:
๐Ÿฆ™ A LlamaExtract agent parses the resume and extracts valuable data that represent your profile
๐Ÿ—„๏ธThe structured data are passed on to a Job Matching Agent (built with LlamaIndex๐Ÿ˜‰) that uses them to build a web search query based on your resume
๐ŸŒ The web search is handled by Linkup, which finds the top matches and returns them to the Agent
๐Ÿ”Ž The agent evaluates the match between your profile and the jobs, and then returns a final answer to you

So, are you ready to find a job suitable for you?๐Ÿ’ผ You can spin up the application completely locally and with Docker, starting from the GitHub repo โžก๏ธ https://github.com/AstraBert/resume-matcher
Feel free to leave your feedback and let me know in the comments if you want an online version of Resume Matcher as well!โœจ