Conversation
* Fix: Prioritize -cwu over -headless flag When both -cwu (Chrome WebSocket URL) and -headless flags are used together, -cwu should take precedence since the user explicitly wants to connect to an existing browser session. Without this fix, when users run: katana -headless -u URL -cwu ws://... -no-incognito The -headless flag causes Katana to launch a new Chrome instance, completely ignoring the -cwu WebSocket URL. This makes authenticated crawling via existing browser sessions impossible when following the official documentation. Changes: - Prioritize ChromeWSUrl check in engine selection (runner.go) - Add warning when -headless is used with -cwu (options.go) - Ensure hybrid engine is selected when -cwu is specified This fix maintains backward compatibility while making the documented workflow actually work as intended. Fixes issue where -cwu flag is ignored when -headless is present. * chore(runner): rm logs --------- Co-authored-by: aparsons2 <[email protected]> Co-authored-by: Dwi Siswanto <[email protected]>
- Add -kb flag to enable page/form/field classification via dit - Add -fpt flag to filter results by page type (auto-enables -kb) - Classification works across all engines (standard, headless, hybrid) - KB data included in jsonl output via knowledgebase field - Bump Go version to 1.25.x across CI, Dockerfile, and docs
) Two-layer hybrid system: - Layer 1: Heuristic regex detects UUIDs, hashes, dates, timestamps, and numeric IDs - Layer 2: Adaptive per-host trie learns parameter positions at runtime via cardinality tracking Structurally identical URLs (e.g., /users/123 and /users/456) are collapsed during crawling, preventing the crawler from wasting time on duplicate pages.
…-urls feat: add -filter-similar flag for URL path pattern deduplication
Bumps [goreleaser/goreleaser-action](https://github.com/goreleaser/goreleaser-action) from 4 to 7. - [Release notes](https://github.com/goreleaser/goreleaser-action/releases) - [Commits](goreleaser/goreleaser-action@v4...v7) --- updated-dependencies: - dependency-name: goreleaser/goreleaser-action dependency-version: '7' dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <[email protected]>
feat(headless): add captcha detection and solving support
…ons/dev/goreleaser/goreleaser-action-7 chore(deps): bump goreleaser/goreleaser-action from 4 to 7
Neo - PR Security ReviewNo security issues found Highlights
Hardening Notes
Comment |
|
@pdneo create an issue to track this release |
|
Important Review skippedAuto reviews are disabled on base/target branches other than the default branch. Please check the settings in the CodeRabbit UI or the ⚙️ Run configurationConfiguration used: Organization UI Review profile: CHILL Plan: Pro Run ID: You can disable this status message by setting the Use the checkbox below for a quick retry:
✨ Finishing Touches🧪 Generate unit tests (beta)
Tip Try Coding Plans. Let us write the prompt for your AI agent so you can ship faster (with fewer bugs). Comment |
|
@dogancanbakir Created issue #1568 to track katana v1.5.0 release Check it out here: #1568
|
…-dit Add knowledge base classification using dit
Relates to #1568