Skip to content

Update #46

@Donj63000

Description

@Donj63000

Hi,

I took the time to really dig into the codebase, not just test the mod at a surface level, and I think there’s currently a noticeable gap between how the project is presented and how it actually works under the hood.

I’ll be direct, but with a constructive intent: in its current state, this isn’t a set of general-purpose autonomous AI agents capable of executing arbitrary complex commands in Minecraft. The architecture is closer to:

LLM → structured response / task plan → Java executor → predefined actions

In other words, the LLM mainly acts as a high-level planner. It interprets natural language into structured tasks, but all the actual in-game execution is handled by hardcoded Java action classes. This means the system is fundamentally constrained by the set of implemented actions, their robustness, and the consistency between the parser, internal API, and executor.

Looking at the code, a few things stand out:

  • several advertised capabilities are still incomplete or stubbed at runtime;
  • the “coder / code execution” path does not appear to be part of the main execution loop;
  • there are inconsistencies between exposed parameters and what actions actually consume;
  • the pipeline works reasonably well for simple commands, but breaks down once you move beyond cases that map cleanly to existing primitives.

Because of this, statements like “execute your commands” or the general impression of broad autonomy are misleading in the current state. The system behaves more like a solid LLM-driven task planner than a fully capable in-game assistant.

This is not meant as a criticism of the work itself — the foundation is interesting, and the overall architecture makes sense. But right now, it feels more like a strong prototype than a production-ready autonomous agent system.

I think the project would benefit from a more technically precise presentation:

  • clearly stating that the LLM handles planning, while execution is limited to predefined actions;
  • distinguishing fully implemented features from experimental or partial ones;
  • avoiding claims of broad autonomy until the execution layer matches the planning layer.

On my side, since I’ve already started analyzing the internal structure, I’ll likely work on improving some parts of the code — especially around local LLM integration, action/API/executor consistency, and a few blocking issues in the execution pipeline. If I manage to clean this up properly, I’ll push commits and/or open a pull request on GitHub.

The goal here isn’t to tear the project down, but to help bring it closer to what it aims to be. There’s real potential, but it needs alignment between what’s implemented and what’s being communicated.

Thanks

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions