Skip to content

Releases: tomasnorre/crawler

[RELEASE] Crawler version 9.2.2

26 Jan 20:44
Compare
Choose a tag to compare

Added

  • AccessService::hasGroupAccess()

Deprecated Functions & Properties

  • CrawlerController->CLI_releaseProcesses()
  • CrawlerController->hasGroupAccess()

Fixed

  • buildQueue with --mode exec resulted in a 503
  • Set content in fetchUrlContents() to false if null, to prevent serializing from failing

[RELEASE] Crawler version 9.2.1

27 Dec 11:48
Compare
Choose a tag to compare

Changed

  • Moved HTML from the LogRequestForm->drawLog_addRows() to Fluid-template
  • Extended the Example for PageTS Configuration

Deprecated Functions

  • SignalSlotUtility->emitSignal()

Deprecated Classes

  • SignalSlotUtility

Fixed

  • PageTS Crawler configurations (broken since 9.1.3)

[RELEASE] Crawler version 9.2.0

22 Dec 07:00
Compare
Choose a tag to compare

Added

  • Add support for TYPO3 11.0

Deprecated Classes

  • CrawlerApi

[RELEASE] Crawler version 9.1.5

19 Dec 16:34
Compare
Choose a tag to compare

Added

  • Error-Handling when Page 0 is used for crawler:buildQueue

Changed

  • Removed all references to cHash as it is not used anymore.

Deprecated Functions & Properties

  • CrawlerController->getLogEntriesForPageId()
  • CrawlerController->CLI_runHooks()
  • QueueRepository->countAllByProcessId()
  • QueueRepository->countUnprocessedItems()
  • FlushQueueCommand --page option

Fixed

  • Typecasting port number in cli/bootstrap to have processQueue working with non-standard ports
  • Flush entire queue works again
  • Check if PageRow['uid'] is integer before handing it on to next function

[RELEASE] Crawler version 9.1.4

27 Nov 21:30
Compare
Choose a tag to compare

Changed

  • IndexedSearchCrawlerHook::class is marked as deprecated and will be removed when dropping support for TYPO3 9LTS and 10LTS

Deprecated Classes

  • IndexedSearchCrawlerHook

Deprecated Functions & Properties

  • CrawlerController->getDuplicateRowsIfExist()

Fixed

  • Auto-loading for non-composer projects fixed

[RELEASE] Crawler version 9.1.3

27 Nov 21:15
Compare
Choose a tag to compare

Added

  • QueueFilter to operate with Object instead of string/arrays

Deprecated
Functions & Properties

  • BackendModule->modMenu()
  • CrawlerController->CLI_debug()
  • CrawlerController->getAccessMode()
  • CrawlerController->setAccessMode()
  • CrawlerController->getDisabled()
  • CrawlerController->setDisabled()
  • CrawlerController->getProcessFilename()
  • CrawlerController->setProcessFilename()
  • CrawlerController->accessMode
  • CrawlerController->processFilename

Changed

  • Updated Rector, PHPStan, ECS for better static code analysis
  • BackendModule is splittet into smaller classes to improve readability and maintainability
  • Crawler Class added, this can be used to check whether the crawler is enabled or disabled

Fixed

  • aoe/crawler/initialization (middleware) is moved to before typo3/cms-core/normalizedParams to have crawler being "last" in middleware chain

[RELEASE] Crawler version 9.1.2

07 Nov 08:10
Compare
Choose a tag to compare

Added

  • Progress bar to the crawler:buildQueue command output when using with --mode exec
  • Improve documentation about proper crawler configuration for user with Website Usergroups

Fixed

  • Detailed process views is callable again
  • Makes sure the QueueRepository is always set when needed in Domain/Model/Process
  • Crawling with FE-Groups is correct initialized with both TYPO3 9 & 10

[RELEASE] Crawler version 9.1.1

17 Oct 17:34
Compare
Choose a tag to compare

Added

  • Documentation example for ext:news
  • CrawlStrategyFactory to move login out of the QueueExecutor

Deprecated
Functions & Properties

  • ProcessRepository->countActive()
  • ProcessRepository->getLimitFromItemCountAndOffset()
  • CrawlerController->getUrlFromPageAndQueryParameters()

Changed

  • UrlService->getUrlFromPageAndQueryParameters() moved from CrawlerController

Fixed

  • Frontend User initialization with UserGroups for crawling protected pages
  • Making sure PageUid added with ExcludeString is kept as integers
  • Instantiation of ProcessRepository and QueueRepository change to GeneralUtility::makeInstance
  • Ensure that DataHandlerHook will not add pages to queue that does not exist

[RELEASE] Crawler version 9.1.0

02 Aug 07:40
Compare
Choose a tag to compare

Added:

  • Adds light red background color + icons to crawler log rows with errors
  • Crawler processing by page priority
  • Automatically adding pages being edited or a page caches is cleared to the crawler queue

[RELEASE] Crawler version 9.0.3

02 Aug 07:40
Compare
Choose a tag to compare

Added:

  • More information into CONTRIBUTING.md about development using container

Fixed:

  • Load crawler initialization before TSFE rendering preparation