Skip to content

Commit defad2c

Browse files
committed
finished project
1 parent a81bb39 commit defad2c

File tree

74 files changed

+16500
-0
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

74 files changed

+16500
-0
lines changed

.DS_Store

8 KB
Binary file not shown.

Data/Key.txt

+3
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,3 @@
1+
2+
API Key: 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
3+

Data/cleaned_df.csv

+5,151
Large diffs are not rendered by default.

Data/topic_2-3_articles_2015.csv

+5,917
Large diffs are not rendered by default.

Figures/.DS_Store

10 KB
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
15.8 KB
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
15 KB
Binary file not shown.
Binary file not shown.
15.8 KB
Binary file not shown.
13.2 KB
Binary file not shown.
17.9 KB
Binary file not shown.

Figures/Prediction/schedule.pdf

18.9 KB
Binary file not shown.

Figures/VectoreStore/.DS_Store

6 KB
Binary file not shown.
20 KB
Binary file not shown.
36.8 KB

Figures/VectoreStore/dist_topics.pdf

15.4 KB
Binary file not shown.

Figures/VectoreStore/dist_topics.png

28.5 KB

Results/gen_prompts_relaxed_chain_rand_0.054_t_1_k1_25_k2_5.json

+1
Large diffs are not rendered by default.

Results/gen_prompts_relaxed_chain_rand_0.090_t_1_k1_25_k2_5.json

+1
Large diffs are not rendered by default.

Results/gen_prompts_relaxed_chain_rand_0.140_t_1_k1_25_k2_5.json

+1
Large diffs are not rendered by default.

Results/gen_prompts_relaxed_chain_rand_0.228_t_1_k1_25_k2_5.json

+1
Large diffs are not rendered by default.

Results/gen_prompts_relaxed_chain_rand_0.361_t_1_k1_25_k2_5.json

+1
Large diffs are not rendered by default.

Results/gen_prompts_relaxed_chain_rand_0_t_0.667_k1_25_k2_5_ucount_1.json

+1
Large diffs are not rendered by default.

Results/gen_prompts_relaxed_chain_rand_0_t_0.667_k1_25_k2_5_ucount_1_top50.json

+1
Large diffs are not rendered by default.

Results/gen_prompts_relaxed_chain_rand_0_t_0.667_k1_25_k2_5_ucount_25.json

+1
Large diffs are not rendered by default.

Results/gen_prompts_relaxed_chain_rand_0_t_1.000_k1_25_k2_5_ucount_1.json

+1
Large diffs are not rendered by default.

Results/gen_prompts_relaxed_chain_rand_0_t_1.000_k1_25_k2_5_ucount_1_top50.json

+1
Large diffs are not rendered by default.

Results/gen_prompts_relaxed_chain_rand_0_t_1.000_k1_25_k2_5_ucount_25.json

+1
Large diffs are not rendered by default.

Results/gen_prompts_relaxed_chain_rand_0_t_1.429_k1_25_k2_5_ucount_1.json

+1
Large diffs are not rendered by default.

Results/gen_prompts_relaxed_chain_rand_0_t_1.429_k1_25_k2_5_ucount_1_top50.json

+1
Large diffs are not rendered by default.

Results/gen_prompts_relaxed_chain_rand_0_t_1.429_k1_25_k2_5_ucount_25.json

+1
Large diffs are not rendered by default.

Results/gen_prompts_relaxed_chain_rand_0_t_1.765_k1_25_k2_5_ucount_1.json

+1
Large diffs are not rendered by default.

Results/gen_prompts_relaxed_chain_rand_0_t_1.765_k1_25_k2_5_ucount_1_top50.json

+1
Large diffs are not rendered by default.

Results/gen_prompts_relaxed_chain_rand_0_t_1.765_k1_25_k2_5_ucount_25.json

+1
Large diffs are not rendered by default.

Results/gen_prompts_relaxed_chain_rand_0_t_1.961_k1_25_k2_5_ucount_1.json

+1
Large diffs are not rendered by default.

Results/gen_prompts_relaxed_chain_rand_0_t_1.961_k1_25_k2_5_ucount_1_top50.json

+1
Large diffs are not rendered by default.

Results/gen_prompts_relaxed_chain_rand_0_t_1.961_k1_25_k2_5_ucount_25.json

+1
Large diffs are not rendered by default.

Results/gen_prompts_relaxed_chain_rand_0_t_1_k_10.json

+1
Large diffs are not rendered by default.

Results/gen_prompts_relaxed_chain_rand_0_t_1_k_5.json

+1
Large diffs are not rendered by default.

Results/gen_prompts_strict_chain_rand_0_t_1_k_5.json

+1
Large diffs are not rendered by default.

Results/random_baseline.json

+1
Large diffs are not rendered by default.

Results/spe_prompts_relaxed_chain_rand_0_t_1_k_5.json

+1
Large diffs are not rendered by default.

Results/spe_prompts_strict_chain_rand_0_t_1_k_5.json

+1
Large diffs are not rendered by default.

evaluation.ipynb

+922
Large diffs are not rendered by default.

llm_prediction.ipynb

+2,403
Large diffs are not rendered by default.

model_generation.ipynb

+2,069
Large diffs are not rendered by default.

prompt_templates/.DS_Store

8 KB
Binary file not shown.
Original file line numberDiff line numberDiff line change
@@ -0,0 +1 @@
1+
{"system": "You are given the information of a news article and a user prompt. \nYou are part of the evaluation system of a recommender system for news articles based on user prompts.\nMore specific the recoomendation system is trying to match user prompts to news articles based on the topics that the user is looking for.\nYour task is to decide if the recommended article matches the user prompt in terms of the topics that the prompt is looking for.\nThe input information is given in german. \nThe different inputs of the news article are seperated by: ####\nMissing values are indicated by the string: nan\n\n{format_instructions}", "system_relaxed": "You are given the information of a news article and a user prompt. \nYou are part of the evaluation system of a recommender system for news articles based on user prompts.\nMore specific the recoomendation system is trying to match user prompts to news articles based on the topics that the user is looking for.\nYour task is to decide if the recommended article matches the user prompt in terms of the topics. In your decision process you are already satisfied if the topics the prompt is looking for are only remotly present in the news article.\nThe input information is given in german. \nThe different inputs of the news article are seperated by: ####\nMissing values are indicated by the string: nan\n\n{format_instructions}", "system_relaxed_2": "You are given the information of a news article and a user prompt. \nYou are part of the evaluation system of a recommender system that matches news articles based on user prompts based on topics.\nYour task is to decide if the recommended article matches the user prompt in terms of the topics that both handle. \nIn your decision process you are already not strict and are already satisfied if the topics the prompt is looking for are only slightly present in the news article.\nThe input information is given in german. \nThe different inputs of the news article are seperated by: ####\nMissing values are indicated by the string: nan\n\n{format_instructions}", "human": " The following is a news article and the corrpsoning prompt. All the information is given in german.\nThe title of the article:\n####\n{title}\n####\nThe text of the article:\n####\n{text}\n####\nThe general classification given by the dataset:\n####\n{general}\n####\nThe subtitle of the article:\n####\n{subtitle}\n####\nThe supertitle of the article:\n{dachzeile}\n###\nThe rubrik and resort of the article:\n{rubrik}, {resort}\n####\nFinally the propmt:\n{prompt}\n####\n", "schema": "Matches the recommended news article the given prompt in terms of topics of the news article that the user prompt is looking for ? Answer True if yes, False if not or unknown.", "schema_relaxed": "Matches the recommended news article the given prompt in terms of topics of the news article that the user prompt is looking for in the slightest sense? Answer True if yes, False if not or unknown.", "schema_relaxed_2": "Matches the recommended news article the given prompt in terms of topics of the news article that the user prompt is looking for in the slightest sense? Answer True if yes, False if not or unknown."}
Original file line numberDiff line numberDiff line change
@@ -0,0 +1 @@
1+
{"system_gen": " You are provided with the textual information and the title as well as subtitle of a new article.\nYour task is to create a Human prompt for the news article, specifically taking into acocount the topics that the article is talking about.\nThe propmt should be general format and not mention specific words or context of the article. Rather conctrate on the general topics described.\nThe prompt must be written in german. The input information is given in german. \nThe different inputs of the news article are seperated by: ####\nMissing values are indicated by the string: nan\n", "system_spe": " You are provided with the textual information and the title as well as subtitle of a new article.\nYour task is to create a Human prompt for the news article, specifically taking into acocount the topics that the article is talking about.\nThe propmt should be very specifict. Meaning is I use it in a query it will only return this article and no other. Conctrate on the specific topics described.\nThe prompt must be written in german. The input information is given in german. \nThe different inputs of the news article are seperated by: ####\nMissing values are indicated by the string: nan\n", "human_temp": " The following is a news article. All the information is given in german.\nThe title of the article:\n####\n{title}\n####\nThe text of the article:\n####\n{text}\n####\nThe general classification given by the dataset:\n####\n{general}\n####\nThe subtitle of the article:\n####\n{subtitle}\n####\nThe supertitle of the article:\n{dachzeile}\n###\nThe rubrik and resort of the article:\n{rubrik}, {resort}\n####\n"}
Original file line numberDiff line numberDiff line change
@@ -0,0 +1 @@
1+
{"FALTER_20151111C499C672CF": [{"human": " The following is a news article. All the information is given in german.\nThe title of the article:\n####\nZampano Murdoch verliert Werbeerl\u00f6se\n####\nThe text of the article:\n####\n['Rupert Murdoch z\u00e4hlt zu den wichtigsten Medienmogulen der Welt, sein Imperium reicht vom konservativen Kampagnensender Fox bis hin zu den Qualit\u00e4tsbl\u00e4ttern Wall Street Journal und Times. Aber es l\u00e4uft nicht alles rund f\u00fcr den australischen Medienzampano.', 'Verglichen mit dem Vorjahreswert gingen die Erl\u00f6se seiner Mediengruppe News Corp im dritten Quartal um vier Prozent zur\u00fcck - auf zwei Milliarden Dollar, umgerechnet 1,8 Milliarden Euro. Seit Jahresbeginn ist die Aktie seiner News Corp um 18 Prozent gefallen. Grund f\u00fcr die Misere sind wegbr\u00f6ckelnde Werbeeinnahmen.']\n####\nThe general classification given by the dataset:\n####\nnan\n####\nThe subtitle of the article:\n####\nnan\n####\nThe supertitle of the article:\nMogulpackung Der Medienmarkt\n###\nThe rubrik and resort of the article:\nnan, Medien\n####\n", "ai_gen": "Ich such nach einen Artikel \u00fcber Nachrichtendienste", "ai_spec": "K\u00f6nntest du Artikel \u00fcber die Gesch\u00e4ftsentwicklung von der Mediengruppe News Corp anzeigen"}]}

0 commit comments

Comments
 (0)