Blog

What’s happening at Agentset.

Stay informed with product updates, company news, and insights on how to sell smarter at your company.

Featured

Tuesday, November 25, 2025
An evaluation of Opus 4.5 inside a real retrieval setup, compared against Gemini 3 Pro and GPT 5.1 across five behaviors that matter for RAG.
Umida Muratbekova
Umida Muratbekova
Wednesday, November 19, 2025
We tested Gemini 3 inside an actual retrieval setup and compared it directly with GPT-5.1 across five areas that matter for RAG.
Umida Muratbekova
Umida Muratbekova
Sunday, November 16, 2025
We compared 13 embedding models across 8 datasets using an LLM judge and ELO scoring. The result: almost all of them perform in the same narrow band.
Umida Muratbekova
Umida Muratbekova
Tuesday, November 25, 2025
Umida Muratbekova
Umida Muratbekova

Opus 4.5 is the new best model for RAG

An evaluation of Opus 4.5 inside a real retrieval setup, compared against Gemini 3 Pro and GPT 5.1 across five behaviors that matter for RAG.

Wednesday, November 19, 2025
Umida Muratbekova
Umida Muratbekova

Gemini 3 vs GPT 5.1 for RAG

We tested Gemini 3 inside an actual retrieval setup and compared it directly with GPT-5.1 across five areas that matter for RAG.

Sunday, November 16, 2025
Umida Muratbekova
Umida Muratbekova

Embedding models have converged

We compared 13 embedding models across 8 datasets using an LLM judge and ELO scoring. The result: almost all of them perform in the same narrow band.

Friday, November 7, 2025
Umida Muratbekova
Umida Muratbekova

Best Reranker for RAG: We tested the top models

We benchmarked eight leading rerankers under identical conditions to find which one performs best for real-world RAG pipelines — comparing speed, accuracy, and LLM-judged relevance.

Monday, October 27, 2025
Umida Muratbekova
Umida Muratbekova

Cohere vs ZeRank: Which Reranker Actually Performs Better?

We compared Cohere v3.5 and ZeRank-1 in a RAG pipeline using a BEIR subset and a custom dataset — analyzing accuracy, latency, and LLM preference.

Thursday, May 1, 2025
Abdellatif Abdelfattah
Abdellatif Abdelfattah

Building Effective RAG Pipelines: A Practical Guide

Learn how to design and implement robust retrieval-augmented generation (RAG) pipelines, from document processing to retrieval optimization.

Tuesday, April 15, 2025
Abdellatif Abdelfattah
Abdellatif Abdelfattah

Is RAG Dead?

OpenAI released the GPT 4.1 models supporting 1M token context window. Gemini supports up to 10M tokens in research. Is the RAG era over?

Tuesday, March 25, 2025
Abdellatif Abdelfattah
Abdellatif Abdelfattah

Automate Business Workflows with AI Agents

Discover how AI agents can transform business operations by automating complex workflows, reducing manual effort, and improving efficiency.

Monday, March 10, 2025
Abdellatif Abdelfattah
Abdellatif Abdelfattah

Building a Proof-of-Concept RAG System in an Afternoon

A practical guide to quickly building a functional retrieval-augmented generation system to demonstrate the value of AI-powered document search.

Tuesday, February 25, 2025
Abdellatif Abdelfattah
Abdellatif Abdelfattah

The Art of Document Chunking for LLM Applications

Explore the nuances of effective document chunking strategies for retrieval-augmented generation systems and how they impact LLM performance.

Monday, February 10, 2025
Abdellatif Abdelfattah
Abdellatif Abdelfattah

Parsing PDF Documents at Scale

Learn strategies and techniques to efficiently extract structured information from large volumes of PDF documents for use in AI applications.