feat: Add GitHub integration with agent_prompts and github_components #1637

julian-risch · 2025-04-10T12:41:54Z

Related Issues

Related to Add GitHub integration with agent_prompts and github_components #1650
Related to Move Agent from experimental to main haystack-experimental#250
@sjrl brought up that one way to keep example files from the experimental Agent around is an integration chore: Remove Agent after Haystack 2.12 release haystack-experimental#263 (comment)

Proposed Changes:

Move github_components from experimental to a new integration
Move agent_prompts from experimental to a new integration

The idea is to enable users to run the example notebook (or a version with updated imports) after having installed this new integration

How did you test it?

New unit tests and I ran all usage examples successfully with a test repo.

I haven't tested it with the notebook yet, which we would need to update first. (tracked by deepset-ai/haystack-cookbook#183 )

Notes for the reviewer

I suggest we rename github_token parameter to api_key for consistency with many other integrations.
Some components have github_token: Optional[Secret] = None, because they can work without any token while others use Secret.from_env_var("GITHUB_TOKEN"). I suggest we use Secret.from_env_var("GITHUB_TOKEN", strict=False) where we currently have None as the default.
The internal implementation of the components differs in how they use _get_headers or _get_request_headers or define headers inline. We could refactor that.
While we could find a way to set up integration tests, I would rather leave them out of this PR.
GithubRepositoryViewer has a branch parameter in the run method, which could also be named ref to make more clear it can also be a tag or commit hash
Similar to the empty lines in prompts and linebreaks: Do we need to add whitespaces in the beginning of new lines. Trailing whitespaces are removed by the linter.

Checklist

I have read the contributors guidelines and the code of conduct
I have updated the related issue with new insights and changes
I added unit tests and updated the docstrings
I've used one of the conventional commit types for my PR title: fix:, feat:, build:, chore:, ci:, docs:, style:, refactor:, perf:, test:.

integrations/github/src/haystack_integrations/prompts/github/comment_tool.py

integrations/github/src/haystack_integrations/prompts/github/context.py

integrations/github/src/haystack_integrations/prompts/github/file_editor_tool.py

sjrl · 2025-04-29T09:27:53Z

integrations/github/README.md

@@ -0,0 +1,21 @@
+# github-haystack


I think it would be good for us to add examples here in the Readme on how to use or to link to the tutorial/google colab for how to use.

Also another relevant detail I think is that these prompts were optimized using Anthropic models. Could be a useful thing for users to know.

google colab in the cookbook and some more examples in an integration page is what I imagine. The README's we currently don't fill out, for example see: https://pypi.org/project/opensearch-haystack/
Might be a good idea to change that and use a copy of the integrations page. I don't see a good reason to keep it empty but I would prefer a consistent solution. I'll talk to Bilge.

integrations/github/src/haystack_integrations/components/connectors/github/pr_creator.py

integrations/github/src/haystack_integrations/components/connectors/github/repo_viewer.py

sjrl · 2025-04-29T10:43:44Z

@julian-risch maybe a general comment on the structure here. I see that the prompts aren't being used within the library and I understand they will be used in a future tutorial/colab.

I wonder then if it would be helpful to instead pre-assemble the tools within the repo so users could easily import the tools and immediately pass them to an Agent. What do you think?

integrations/github/src/haystack_integrations/components/connectors/github/file_editor.py

sjrl · 2025-04-29T12:17:09Z

integrations/github/src/haystack_integrations/components/connectors/github/issue_viewer.py

+    def _get_request_headers(self) -> dict:
+        """
+        Get headers with resolved token for the request.
+
+        :return: Dictionary of headers including authorization if token is present
+        """


I like this pattern for an incidental reason. By not putting the resolve of the Secret in the init method we allow users to not need to provide the secret until run time which is nice especially when running pipeline validation like in the deepset platform where the secrets are not available to the pipeline in the pipeline builder.

Could be nice to do this in the other components, basically to move the resolve_value() outside of the init method.

sjrl · 2025-04-29T12:19:30Z

@julian-risch overall this looks really good! I mostly have minor comments and only one larger conceptual one about maybe providing users Tools directly instead of needing to compose them, themselves.

I didn't comb through every line since there is a lot, but it's well tested so it's good to go from my perspective! We can always make quick updates to this if things arise and depending on usage.

add agent_prompts and github_components

4aba2a9

github-actions bot added the type:documentation Improvements or additions to documentation label Apr 10, 2025

julian-risch added 2 commits April 10, 2025 15:20

rename to github_haystack

890ecde

remove github-haystack

b99ad72

This was referenced Apr 10, 2025

add agents example with GitHub components from experimental deepset-ai/haystack-cookbook#183

Draft

chore: Remove Agent after Haystack 2.12 release deepset-ai/haystack-experimental#263

Merged

julian-risch assigned julian-risch and unassigned julian-risch Apr 10, 2025

This was referenced Apr 10, 2025

Move Agent from experimental to main deepset-ai/haystack-experimental#250

Open

Add GitHub integration with agent_prompts and github_components #1650

Open

julian-risch added 10 commits April 15, 2025 08:06

Merge branch 'main' into move-github-components

97029b1

renamed integration, added components dir

af37ecd

add tests, pydoc, update pyproject.toml

ac4bf31

add workflow

2e2202a

fmt

fcfec47

fmt

79dc7db

lint

cf3a2f5

ruff

f82412f

fmt

fa9cbe3

lint:all

c4beece

github-actions bot added the topic:CI label Apr 16, 2025

julian-risch added 8 commits April 16, 2025 16:36

replace StrEnum for py 3.9+ compatibility

4a6b81b

move files

e77a49e

fix tests

4e49081

lint

7144176

fix pydoc and extend init files

b4a375d

Add integration:github to labeler.yml

2b8bc14

unify how we set GITHUB_TOKEN in tests

8480832

fix 3 usage examples. 3 remaining

ca977a1

julian-risch marked this pull request as ready for review April 25, 2025 10:28

julian-risch requested a review from a team as a code owner April 25, 2025 10:28

julian-risch requested review from sjrl and removed request for a team April 25, 2025 10:28

Merge branch 'main' into move-github-components

b735b76