nutch-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From sna...@apache.org
Subject [nutch] branch master updated (33922fe -> fd31cea)
Date Fri, 22 Feb 2019 14:22:26 GMT
This is an automated email from the ASF dual-hosted git repository.

snagel pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/nutch.git.


    from 33922fe  NUTCH-2694 HostDB to aggregate by long instead of integer
     new 3abe7db  NUTCH-2695: fix some alerts raised by LGTM - `Wrong NaN comparison` in Generator
- `Type mismatch on container modification` in NutchServerPoolExecutor - `Missing format argument`
in CrawlDbReader
     new 31ecf64  NUTCH-2695: fix some alerts raised by LGTM - useless checks for null in
OutlinkExtractor, protocol-http,   ParsePluginsReader, parse-html, parse-tika - called LOG.equals(...)
instead of LOG.error(...) in ArcRecordReader - constant loop condition in protocol-http -
implicit conversion from array to string in CrawlDbReducer - `Missing format argument` in
log methods - spurious Javadoc @param tags
     new fd31cea  Merge branch 'NUTCH-2695', closes #438

The 2848 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.


Summary of changes:
 src/java/org/apache/nutch/crawl/CrawlDbReader.java |  2 +-
 .../org/apache/nutch/crawl/CrawlDbReducer.java     |  4 +-
 src/java/org/apache/nutch/crawl/Generator.java     |  2 +-
 .../apache/nutch/hostdb/UpdateHostDbMapper.java    |  3 --
 .../apache/nutch/hostdb/UpdateHostDbReducer.java   |  2 -
 .../nutch/indexer/IndexingFiltersChecker.java      |  2 +-
 .../org/apache/nutch/parse/OutlinkExtractor.java   |  2 +-
 src/java/org/apache/nutch/parse/ParseData.java     | 18 ++-------
 .../org/apache/nutch/parse/ParsePluginsReader.java |  2 +-
 .../org/apache/nutch/segment/SegmentMerger.java    |  4 +-
 .../org/apache/nutch/service/impl/LinkReader.java  |  8 ++--
 .../org/apache/nutch/service/impl/NodeReader.java  |  8 ++--
 .../service/impl/NutchServerPoolExecutor.java      |  2 +-
 .../apache/nutch/service/impl/SequenceReader.java  |  8 ++--
 .../org/apache/nutch/tools/arc/ArcInputFormat.java |  4 +-
 .../apache/nutch/tools/arc/ArcRecordReader.java    |  2 +-
 .../apache/nutch/tools/arc/ArcSegmentCreator.java  |  4 +-
 .../org/apache/nutch/util/EncodingDetector.java    |  6 +--
 src/java/org/apache/nutch/util/MimeUtil.java       |  3 +-
 .../apache/nutch/parse/html/HTMLMetaProcessor.java | 45 ++++++++++------------
 .../apache/nutch/parse/tika/HTMLMetaProcessor.java | 45 ++++++++++------------
 .../apache/nutch/protocol/http/HttpResponse.java   |  4 +-
 .../nutch/scoring/orphan/OrphanScoringFilter.java  | 12 ++++--
 23 files changed, 87 insertions(+), 105 deletions(-)


Mime
View raw message