Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Divide and Conquer Approach #404

Open
wants to merge 10 commits into
base: develop
Choose a base branch
from
Open

Divide and Conquer Approach #404

wants to merge 10 commits into from

Conversation

EneaGore
Copy link
Contributor

@EneaGore EneaGore commented Jan 27, 2025

Motivation and Context

A classic approach. Split up the grading instructions into criteria and create specific prompts for each criterion. Asynchronously invoke the LLM for each criterion (practically 0 added latency) (output token remains more or less the same, but elevated input token costs). The responses are combined into one final assessment.

Some preconditions:

  1. The exercise must have structured grading criterions.
  2. The usage counts must be all the same within one criterion. They can differ between criterions.
  3. The usage counts must be well defined, If you use the default 0, this approach takes it as if it can be applied as many times as possible.

Evaluation

Evaluation on Exercise: System Design Review (SS21) with 9 credits. and 3 Criteria.
newplot (1)
newplot (2)
newplot (3)

Steps for Testing

Testserver States

Note

These badges show the state of the test servers.
Green = Currently available, Red = Currently locked
Click on the badges to get to the test servers.


Screenshots

@EneaGore EneaGore marked this pull request as ready for review January 27, 2025 20:28
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant