Divide and Conquer Approach #404

EneaGore · 2025-01-27T07:53:44Z

Motivation and Context

A classic approach. Split up the grading instructions into criteria and create specific prompts for each criterion. Asynchronously invoke the LLM for each criterion (practically 0 added latency) (output token remains more or less the same, but elevated input token costs). The responses are combined into one final assessment.

Some preconditions:

The exercise must have structured grading criterions.
The usage counts must be all the same within one criterion. They can differ between criterions.
The usage counts must be well defined, If you use the default 0, this approach takes it as if it can be applied as many times as possible.

Evaluation

Evaluation on Exercise: System Design Review (SS21) with 9 credits. and 3 Criteria.

Steps for Testing

Testserver States

Note

These badges show the state of the test servers.
Green = Currently available, Red = Currently locked
Click on the badges to get to the test servers.

Screenshots

…into divide-and-conque

base for divide and conquer approach

ef2a498

github-actions bot assigned EneaGore Jan 27, 2025

= Enea_Gore added 6 commits January 27, 2025 17:04

inital implementation

7e5197d

some refactoring and sanitizing

3c6d46d

cohesive refactoring and prompt improvments

09fe557

linters paradise

0d314af

lint

1214dce

remove uneccessary parameters

b21c67d

EneaGore marked this pull request as ready for review January 27, 2025 20:28

EneaGore and others added 3 commits February 15, 2025 20:28

Merge branch 'develop' into divide-and-conque

444ab67

minor refactoring

a218776

Merge branch 'divide-and-conque' of https://github.com/ls1intum/Athena …

3276431

…into divide-and-conque

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Divide and Conquer Approach #404

Divide and Conquer Approach #404

EneaGore commented Jan 27, 2025 •

edited

Loading

Divide and Conquer Approach #404

Are you sure you want to change the base?

Divide and Conquer Approach #404

Conversation

EneaGore commented Jan 27, 2025 • edited Loading

Motivation and Context

Evaluation

Steps for Testing

Testserver States

Screenshots

EneaGore commented Jan 27, 2025 •

edited

Loading