nutch.git
2 days ago  Sebastian NagelMerge pull request #407 from sebastian-nagel/NUTCH... master
2018-11-19  Sebastian NagelNUTCH-2668 Integrate OWASP dependency checks as ant...
2018-11-19  Sebastian NagelMerge pull request #401 from sebastian-nagel/dependency...
2018-11-19  Sebastian NagelNUTCH-1842: crawl.gen.delay value is read incorrectly...
2018-11-19  Sebastian NagelMerge pull request #392 from sebastian-nagel/NUTCH...
2018-11-15  Sebastian NagelNUTCH-1842: crawl.gen.delay value is read incorrectly...
2018-11-15  Sebastian NagelNUTCH-2671 Upgrade to ant ivy library
2018-11-15  Sebastian NagelNUTCH-2671 Upgrade to ant ivy library
2018-11-15  Sebastian NagelNUTCH-2671 Upgrade to ant ivy library
2018-11-15  Jorge Luis... NUTCH-2658 Adding the fields required by the index...
2018-11-15  Sebastian NagelNUTCH-2651 Upgrade to Tika 1.19.1 (from 1.18)
2018-11-15  Jorge Luis... NUTCH-2661 Move the TestOutlinks class into the o.a...
2018-11-15  Sebastian NagelNUTCH-2660 Plugin tests not executed
2018-11-15  Sebastian NagelNUTCH-2659 Add missing Apache license headers
2018-11-15  Sebastian NagelNUTCH-2655 Update Solr schema.xml for Solr 7.x
2018-11-15  Sebastian NagelNUTCH-2652 Fetcher launches more fetch tasks than fetch...
2018-11-15  Sebastian NagelNUTCH-2651 Upgrade core and parse-tika to use Tika...
2018-11-15  Sebastian NagelNUTCH-2630 Fetcher to log skipped records by robots.txt
2018-11-15  Sebastian NagelNUTCH-2625 ProtocolFactory.getProtocol(url) may create...
2018-11-14  Sebastian NagelMerge pull request #387 from sebastian-nagel/NUTCH...
2018-11-14  Sebastian NagelMerge pull request #395 from sebastian-nagel/NUTCH...
2018-11-11  Jorge Luis... Merge pull request #402 from jorgelbg/index-links-schema
2018-11-08  Sebastian NagelNUTCH-2674 HostDb: dump shows wrong column headers 407/head
2018-10-30  Sebastian NagelNUTCH-2671 Upgrade to ant ivy library
2018-10-30  Sebastian NagelNUTCH-2671 Upgrade to ant ivy library
2018-10-30  Sebastian NagelMerge pull request #406 from sebastian-nagel/NUTCH...
2018-10-29  Sebastian NagelNUTCH-2671 Upgrade to ant ivy library 406/head
2018-10-24  Sebastian NagelNUTCH-2668 Integrate OWASP dependency checks as ant... 401/head
2018-10-23  Jorge Luis... NUTCH-2658 Adding the fields required by the index... 402/head
2018-10-23  Jorge Luis... Merge pull request #396 from sebastian-nagel/NUTCH...
2018-10-23  Jorge Luis... Merge pull request #399 from jorgelbg/indexer-link...
2018-10-21  Sebastian NagelNUTCH-2651 Upgrade to Tika 1.19.1 (from 1.18)
2018-10-20  Sebastian NagelMerge pull request #394 from sebastian-nagel/NUTCH...
2018-10-20  Sebastian NagelMerge pull request #391 from sebastian-nagel/NUTCH...
2018-10-20  Sebastian NagelMerge pull request #397 from sebastian-nagel/NUTCH...
2018-10-20  Sebastian NagelMerge pull request #368 from sebastian-nagel/NUTCH...
2018-10-17  Jorge Luis... NUTCH-2661 Move the TestOutlinks class into the o.a... 399/head
2018-10-17  Sebastian NagelNUTCH-2660 Plugin tests not executed 397/head
2018-10-17  Sebastian NagelNUTCH-2659 Add missing Apache license headers 396/head
2018-10-15  Sebastian NagelNUTCH-2655 Update Solr schema.xml for Solr 7.x 395/head
2018-10-15  Sebastian NagelNUTCH-2652 Fetcher launches more fetch tasks than fetch... 394/head
2018-10-14  YossiTamariNUTCH-1842: crawl.gen.delay value is read incorrectly... 393/head
2018-10-13  Sebastian NagelNUTCH-2606 MIME detection is wrong for plain-text docum... 392/head
2018-10-13  Sebastian NagelMerge pull request #389 from sebastian-nagel/NUTCH...
2018-10-12  Sebastian NagelNUTCH-2651 Upgrade core and parse-tika to use Tika... 391/head
2018-10-10  Sebastian NagelNUTCH-2192 Migrate from Apache ORO to java.util.regex 389/head
2018-10-09  Sebastian NagelNUTCH-1121 JUnit test for parse-js
2018-10-09  Sebastian NagelNUTCH-2192 NUTCH-1678 NUTCH-1014 NUTCH-1021 Migrate...
2018-10-09  Sebastian NagelMerge pull request #388 from sebastian-nagel/NUTCH...
2018-10-08  Sebastian NagelNUTCH-2648 Make configurable whether TLS/SSL certificat... 388/head
2018-10-08  Sebastian NagelNUTCH-2648 Make configurable whether TLS/SSL certificat...
2018-10-08  Sebastian NagelNUTCH-2630 Fetcher to log skipped records by robots.txt 387/head
2018-10-07  Sebastian NagelMerge pull request #369 from sebastian-nagel/NUTCH...
2018-10-07  Sebastian NagelMerge pull request #383 from sebastian-nagel/NUTCH...
2018-10-07  Sebastian NagelMerge pull request #382 from sebastian-nagel/NUTCH...
2018-10-07  Sebastian NagelMerge pull request #376 from sebastian-nagel/NUTCH...
2018-10-07  Sebastian NagelMerge pull request #385 from sebastian-nagel/NUTCH...
2018-09-30  Sebastian NagelNUTCH-2623 Fetcher to guarantee delay for same host... 369/head
2018-09-28  Markus JelsmaNUTCH-2647 Skip TLS certificate checks in protocol...
2018-09-27  Roannel Fernández... Merge pull request #356 from r0ann3l/NUTCH-2602
2018-09-27  r0ann3lMerge branch 'master' into NUTCH-2602 356/head
2018-09-26  Sebastian NagelNUTCH-2642 MoreIndexingFilter parses ISO 8601 UTC dates... 385/head
2018-09-13  Sebastian NagelNUTCH-2645 Webgraph tools ignore command-line options 383/head
2018-09-13  Sebastian NagelProtocolStatusStatistics: job configuration should...
2018-09-13  Sebastian NagelNUTCH-2644 CrawlDbReader -dump ignores filter options
2018-09-12  Sebastian NagelNUTCH-2643 ant target "resolve-default" to depend on... 382/head
2018-09-11  rustyxNUTCH-2639 bin/nutch fails to set native library path...
2018-08-17  Sebastian NagelMerge pull request #365 from sebastian-nagel/NUTCH...
2018-08-17  Sebastian NagelNUTCH-2632 protocol-okhttp doesn't accept proxy authent...
2018-08-17  Sebastian NagelNUTCH-2632 protocol-okhttp doesn't accept proxy authent...
2018-08-17  Lewis John... NUTCH-2633 Fix deprecation warnings when building Nutch...
2018-08-16  Sebastian NagelNUTCH-2635 Generator writes unneeded temporary output 376/head
2018-08-11  Lewis John... NUTCH-2633 Fix deprecation warnings when building Nutch...
2018-08-09  Steven WoodardNUTCH-2632 protocol-okhttp proxy authentication 375/head
2018-08-07  Sebastian NagelPrepare for new development after release of 1.15
2018-07-30  r0ann3lFixes for NUTCH-2602: Description as a table with colum...
2018-07-25  Sebastian NagelMerge pull request #366 from sebastian-nagel/NUTCH...
2018-07-25  Sebastian NagelMerge pull request #367 from sebastian-nagel/NUTCH...
2018-07-24  r0ann3lMerge branch 'master' into NUTCH-2602
2018-07-24  Sebastian NagelNUTCH-2625 ProtocolFactory.getProtocol(url) may create... 368/head
2018-07-24  Sebastian NagelNUTCH-2624 protocol-okhttp resource leak 367/head
2018-07-20  Sebastian NagelNUTCH-2622 Unbundle LGPL-licensed jars from binary... 366/head
2018-07-20  Sebastian NagelNUTCH-2621 Generate report of third-party licenses 365/head
2018-07-19  Sebastian NagelMerge pull request #364 from sebastian-nagel/NUTCH...
2018-07-19  Sebastian NagelMerge pull request #361 from sebastian-nagel/NUTCH...
2018-07-19  Sebastian NagelMerge pull request #355 from sebastian-nagel/NUTCH...
2018-07-19  Sebastian NagelMerge pull request #363 from sebastian-nagel/NUTCH...
2018-07-17  Sebastian NagelNUTCH-1993 Nutch does not use backup parsers 364/head
2018-07-17  Sebastian NagelMerge pull request #359 from sebastian-nagel/NUTCH...
2018-07-17  Sebastian NagelMerge pull request #358 from sebastian-nagel/NUTCH...
2018-07-17  Sebastian NagelNUTCH-2616 Review routing of deletions by Exchange... 363/head
2018-07-16  Sebastian NagelNUTCH-2620 urlfilter-validator incorrectly assumes...
2018-07-13  Gareth Owentypo in fix 362/head
2018-07-13  Gareth OwenFix invalid assumption in URL validator
2018-07-11  Sebastian NagelNUTCH-2071 358/head
2018-07-11  Sebastian NagelNUTCH-2071 A parser failure on a single document
2018-07-11  Sebastian NagelNUTCH-1106 Options to skip url's based on length 359/head
2018-07-11  Sebastian NagelNUTCH-1106 Options to skip url's based on length
2018-07-11  Sebastian NagelNUTCH-2619 protocol-okhttp: allow to keep partially... 361/head
2018-07-11  Sebastian NagelNUTCH-2618 protocol-okhttp not to use http.timeout...
next