heritrix3

Clone Tools
  • last updated a few seconds ago
Constraints
Constraints: committers
 
Constraints: files
Constraints: dates
[maven-release-plugin] prepare for next development iteration

[maven-release-plugin] prepare release heritrix-3.4.0-20200518-NAS-6.0

[maven-release-plugin] rollback the release of heritrix-3.4.0-20200518-NAS-6.0

[maven-release-plugin] prepare release heritrix-3.4.0-20200518-NAS-6.0

[maven-release-plugin] rollback the release of heritrix-3.4.0-20200518-NAS-6.0

[maven-release-plugin] prepare release heritrix-3.4.0-20200518-NAS-6.0

[maven-release-plugin] rollback the release of heritrix-3.4.0-20200518-NAS-6.0

[maven-release-plugin] prepare for next development iteration

[maven-release-plugin] prepare release heritrix-3.4.0-20200518-NAS-6.0

[maven-release-plugin] rollback the release of heritrix-3.4.0-20200518-NAS-6.0

[maven-release-plugin] prepare for next development iteration

[maven-release-plugin] prepare release heritrix-3.4.0-20200518-NAS-6.0

Set correct release version for h3 3.4.0 for NAS 6.0

Merged latest upstreams for NAS-6.0-SNAPSHOT

Merge remote-tracking branch 'origin/master' into netarkivet-h3-6.0

# Conflicts:

# .gitignore

# commons/pom.xml

# contrib/pom.xml

# dist/pom.xml

# engine/pom.xml

# engine/src/main/java/org/archive/crawler/Heritrix.java

# engine/src/main/java/org/archive/crawler/framework/CrawlJob.java

# engine/src/main/java/org/archive/crawler/framework/Frontier.java

# modules/pom.xml

# modules/src/main/java/org/archive/modules/deciderules/MatchesListRegexDecideRule.java

# modules/src/main/java/org/archive/modules/extractor/ExtractorHTML.java

# modules/src/main/java/org/archive/modules/extractor/HTMLLinkContext.java

# modules/src/test/java/org/archive/modules/deciderules/MatchesListRegexDecideRuleTest.java

# modules/src/test/java/org/archive/modules/extractor/ExtractorHTMLTest.java

# pom.xml

Merge pull request #328 from morokosi/fix-matchesregexdeciderule

Fix match result is always false in MatchesListRegexDecideRule

optimize imports

fix discarded future value

Merge pull request #320 from bnfleb/sftp

Add support for the SFTP protocol

Merge pull request #323 from clawia/master

Add parsing for HTML tags (data-*)

Remove static fields and byte loop + rename variable

rename variable + javadoc

Merge pull request #326 from clawia/crawl-status-and-report

Add real crawlStatus in the crawlReport

Add overrides in JerichoExtractorHTMLTest

Add parsing case data-*

Revert ExtractorHTML

Add missing import

Merge pull request #325 from internetarchive/yt-dl-format

youtube-dl: request best medium-ish size format

Document GET /engine

Add missing newline