Diagram showing a safer ABAP AI development pattern with a local IDE and LLM, git repository, CI/CD, and a read-only ADT MCP server connected to the SAP system.

ABAP and Agentic AI: The Hidden Problem in Real Projects

Series note: This post is part of my AI ABAP development series, where I go from AI development in general, to ABAP-specific problems, and then to ARC-1. In the first post of this series, I wrote more generally about how I use AI today, why context matters so much, and why I am still not fully convinced by the more maximalist agentic hype. Here I want to get more concrete. Because SAP development is not one thing, and that matters a lot for AI. ...

April 22, 2026 · 13 min · 2625 words · Marian Zeis
Screenshot of the AI development workflow with ChatGPT, Claude Code, and MCP servers.

How I Use AI for Development and Why Context Matters

Series note: This post is part of my AI ABAP development series, where I go from AI development in general, to ABAP-specific problems, and then to ARC-1. When it comes to AI, I do not really think about magic. I think more about another colleague helping me out. Sometimes that means architecture. Sometimes implementation. Sometimes testing, review, CI/CD, or deployment. But this only works with trust, and trust needs communication. If I do not know something, I ask. If code looks strange, I ask. If tests fail, I ask. ...

April 20, 2026 · 10 min · 2037 words · Marian Zeis
Screenshot of the sap-ai-mcp-servers README showing the overview tables and categories.

SAP MCP Servers: The Missing Overview

If you are searching for SAP MCP Servers (Model Context Protocol), you quickly run into the same problem: Which SAP MCP servers exist (official and community), and what other SAP AI tooling should you know about? That is why I like this new repo: marianfoo/sap-ai-mcp-servers: SAP MCP Servers and SAP AI Skills list It is a curated list of SAP MCP servers, SAP AI skills, and adjacent developer tools. The README is structured as tables so you can scan by category, compare licenses, and see what is actively maintained. ...

March 5, 2026 · 2 min · 239 words · Marian Zeis
Bar chart showing cumulative ABAP understanding success rates by model and feedback round.

SAP’s ABAP-1 Loses Every ABAP Benchmark, Even “Explaining”

Previous post (code generation benchmark): Benchmarking LLMs for ABAP Live benchmark results (old + new): abap-llm-benchmark.marianzeis.de In my first evaluation (based on the TH Köln benchmark paper), I extended the original setup with additional models and focused on a very concrete question: how well can LLMs generate ABAP code that actually compiles and passes ABAP Unit tests? I also tested SAP’s model ABAP-1, and it performed very poorly for code generation. To be fair: SAP also states this in the documentation. ABAP-1 is primarily meant for explaining ABAP code not for reliably generating full working implementations. ...

March 3, 2026 · 5 min · 1005 words · Marian Zeis
Screenshot showing the integration of SAP RAP actions in the AI agent workflow.

Building an Enterprise-Ready SAP AI Agent with Open Source

There is currently a huge hype around the new MCP (Model Context Protocol) servers. I’ve built one for SAP Docs, and there are excellent community projects like the ADT API MCP Server. These tools are fantastic for individual developers. You install them, connect them to your IDE or a local LLM, and suddenly your AI assistant knows about SAP. Architecture overview (in one picture) Before diving into the details, here is the high-level architecture I’m aiming for: ...

March 3, 2026 · 8 min · 1681 words · Marian Zeis
Bar chart showing cumulative ABAP code generation success rates by model and feedback round.

Benchmarking LLMs for ABAP: Why ABAP-1 Isn't a Code Generator (Yet)

Live benchmark results: abap-llm-benchmark.marianzeis.de In a lot of SAP webcasts and webinars, especially around AI, the question comes up very early: which model are you using, and which one do you recommend? For CAP and UI5 the answer is usually pretty simple: use the current best model from Anthropic. If you add good context via MCP servers from the community or SAP, you are basically fine. There is just a lot of public knowledge available, and most of it is in JavaScript/TypeScript, which LLMs handle extremely well. ...

February 9, 2026 · 6 min · 1253 words · Marian Zeis