nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From dyzsasd <...@git.apache.org>
Subject [GitHub] nutch pull request: Branch 2.3.1
Date Mon, 12 Oct 2015 15:53:25 GMT
GitHub user dyzsasd opened a pull request:

    https://github.com/apache/nutch/pull/72

    Branch 2.3.1

    

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/apache/nutch branch-2.3.1

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/nutch/pull/72.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #72
    
----
commit fa88ac21de22536c7bd464d59204d8fbf034aa53
Author: Lewis John McGibbney <lewismc@apache.org>
Date:   2013-06-27T17:21:35Z

    prepare for new development
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1497462 13f79535-47bb-0310-9956-ffa450edef68

commit 9728ed2267e359772c6e8aa61f0bde69b7237f2d
Author: Lewis John McGibbney <lewismc@apache.org>
Date:   2013-06-27T18:01:56Z

    update for release report
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1497480 13f79535-47bb-0310-9956-ffa450edef68

commit e868ed8d22f0ff69f7fa0da60269d09f30698469
Author: lufeng <fenglu@apache.org>
Date:   2013-07-01T13:34:23Z

    NUTCH-1594 count variable is never changed in ParseUtil class 
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1498437 13f79535-47bb-0310-9956-ffa450edef68

commit fe9ea2aad1e75e419048d454992a5a56ceac8a1d
Author: Markus Jelsma <markus@apache.org>
Date:   2013-07-05T10:27:47Z

    NUTCH-1595 Upgrade to Tika 1.4 (jnioche, markus)
    
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1499959 13f79535-47bb-0310-9956-ffa450edef68

commit d5cb787bead9589df0fe4f896fbb2ed17f059d9c
Author: Julien Nioche <jnioche@apache.org>
Date:   2013-07-08T08:50:08Z

    NUTCH-1604 Protocol-factory not thread-safe
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1500610 13f79535-47bb-0310-9956-ffa450edef68

commit ccd793cd35768377231d77c01c5e9a9b700694f1
Author: Sebastian Nagel <snagel@apache.org>
Date:   2013-07-25T21:15:02Z

    NUTCH-1587 misspelled property "threshold" in conf/log4j.properties
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1507131 13f79535-47bb-0310-9956-ffa450edef68

commit d4deef989ffc41b9dd5e77683e73286d81e1178b
Author: Sebastian Nagel <snagel@apache.org>
Date:   2013-08-07T21:10:17Z

    NUTCH-911 protocol-file to return proper protocol status for notmodified, gone, access_denied
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1511496 13f79535-47bb-0310-9956-ffa450edef68

commit 46dae3c0f754f212f7260d897bbd0785c19cd418
Author: lufeng <fenglu@apache.org>
Date:   2013-08-13T15:17:05Z

    NUTCH-1294 IndexClean job with solr implementation.
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1513543 13f79535-47bb-0310-9956-ffa450edef68

commit f7a76daaeb0c0f3686ececb1d946529f28f6ff17
Author: lufeng <fenglu@apache.org>
Date:   2013-08-13T15:21:34Z

    NUTCH-1294 IndexClean job with solr implementation.
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1513548 13f79535-47bb-0310-9956-ffa450edef68

commit 0508944f9bfbbf5f6b6898a95d156d2977ab3137
Author: Lewis John McGibbney <lewismc@apache.org>
Date:   2013-08-18T23:02:53Z

    NUTCH-1624 Typo in WebTableReader line 486
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1515240 13f79535-47bb-0310-9956-ffa450edef68

commit 86c1f5584a49d45ac1d150a8dafedbd2af7351c1
Author: Julien Nioche <jnioche@apache.org>
Date:   2013-08-23T08:52:38Z

    NUTCH-1629 Injector skips empty lines
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1516752 13f79535-47bb-0310-9956-ffa450edef68

commit 936389646645b84816579f30c96077a678de5b1c
Author: Lewis John McGibbney <lewismc@apache.org>
Date:   2013-08-23T19:47:16Z

    NUTCH-1631 Display Document Count Added to Solr Server
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1517003 13f79535-47bb-0310-9956-ffa450edef68

commit 33bed204bb922e9d5b3f3d67f2b61757ce3fdd9e
Author: lufeng <fenglu@apache.org>
Date:   2013-08-24T15:21:20Z

    NUTCH-1619 Writes Dmoz Description and Title information to db with snippet argument.
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1517147 13f79535-47bb-0310-9956-ffa450edef68

commit a0030f4ef10f2866ccae90afadc8f3460911f88d
Author: lufeng <fenglu@apache.org>
Date:   2013-08-24T15:50:01Z

    NUTCH-1619 Writes Dmoz Description and Title information to db with snippet argument.
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1517155 13f79535-47bb-0310-9956-ffa450edef68

commit 1d62b185abbd6f98c3dd644861bfb44d036bde8a
Author: lufeng <fenglu@apache.org>
Date:   2013-09-05T14:40:25Z

    NUTCH-1556 enabling updatedb to accept batchId
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1520332 13f79535-47bb-0310-9956-ffa450edef68

commit 3a0eb5bdcb2a3ab14c4cf1093e50e3e5dc5ffd8b
Author: lufeng <fenglu@apache.org>
Date:   2013-09-12T13:23:24Z

    NUTCH-1556 enabling updatedb to accept batchId
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1522566 13f79535-47bb-0310-9956-ffa450edef68

commit 3a63fe35fb5c1e07d061a33980070110b30660cd
Author: Julien Nioche <jnioche@apache.org>
Date:   2013-09-20T08:03:24Z

    NUTCH-1641 Log timings for main jobs
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1524931 13f79535-47bb-0310-9956-ffa450edef68

commit 25d97ee80cf5815bba35ff929619e5f00f74d39b
Author: Lewis John McGibbney <lewismc@apache.org>
Date:   2013-10-27T11:54:36Z

    NUTCH-1124 JUnit tests for OPIC Scoring
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1536106 13f79535-47bb-0310-9956-ffa450edef68

commit 0162aef1ef287292394a2ef078381dbb0f73a659
Author: Lewis John McGibbney <lewismc@apache.org>
Date:   2013-11-01T18:44:23Z

    NUTCH-1125 JUnit test for TLD
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1538023 13f79535-47bb-0310-9956-ffa450edef68

commit 9bfd1fdd00aee4cb6f5c049e305bde6d3917f573
Author: Lewis John McGibbney <lewismc@apache.org>
Date:   2013-11-02T14:03:57Z

    NUTCH-1413 Record response time
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1538193 13f79535-47bb-0310-9956-ffa450edef68

commit 295ea6bf338a8fd762ca6ba855011bd966222aba
Author: Lewis John McGibbney <lewismc@apache.org>
Date:   2013-11-02T14:11:04Z

    NUTCH-1650 Adaptive Fetch Scheduler interval Wrong Set
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1538195 13f79535-47bb-0310-9956-ffa450edef68

commit de47b8e2e39c250b9cf2c76070b57aa739494d3a
Author: Lewis John McGibbney <lewismc@apache.org>
Date:   2013-11-02T14:16:28Z

    NUTCH-1588 Port NUTCH-1245 URL gone with 404 after db.fetch.interval.max stays db_unfetched
in CrawlDb and is generated over and over again to 2.x
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1538200 13f79535-47bb-0310-9956-ffa450edef68

commit 1b15606816bf76dca22df7ff644e36db0e145eb6
Author: Lewis John McGibbney <lewismc@apache.org>
Date:   2013-11-02T20:52:19Z

    NUTCH-1360 Suport the storing of IP address connected to when web crawling
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1538280 13f79535-47bb-0310-9956-ffa450edef68

commit 0429d858a2379cac33fb8f64a39e6d9c0fce5d02
Author: Lewis John McGibbney <lewismc@apache.org>
Date:   2013-11-04T19:11:16Z

    NUTCH-1651 modifiedTime and prevmodifiedTime never set
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1538723 13f79535-47bb-0310-9956-ffa450edef68

commit 01d5123ace23143974d1b9b5d364764c6c073b93
Author: Julien Nioche <jnioche@apache.org>
Date:   2013-11-14T12:12:32Z

    Removed all in one Crawl class (NUTCH-1621)
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1541886 13f79535-47bb-0310-9956-ffa450edef68

commit 38aa2dc51a869215aac52ead46274da582635a37
Author: Julien Nioche <jnioche@apache.org>
Date:   2013-11-15T09:20:03Z

    Removed all in one Crawl class (NUTCH-1621)
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1542208 13f79535-47bb-0310-9956-ffa450edef68

commit c96a55308d639de95083f090494f0a4a36be54e0
Author: Sebastian Nagel <snagel@apache.org>
Date:   2013-11-21T22:04:13Z

    NUTCH-1587 misspelled property "threshold" in log4j.properties
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1544341 13f79535-47bb-0310-9956-ffa450edef68

commit 7232641bba876a1b423061331bb047be2c4cbf2a
Author: Lewis John McGibbney <lewismc@apache.org>
Date:   2013-11-27T10:14:18Z

    NUTCH-1673 Title isn't reset in MoreIndexingFilter
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1545982 13f79535-47bb-0310-9956-ffa450edef68

commit 0ab335e9f73194e30dcd5d2065996853067b42f1
Author: Lewis John McGibbney <lewismc@apache.org>
Date:   2013-12-23T15:06:41Z

    NUTCH-1681 In URLUtil.java, toUNICODE method does not work correctly
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1553125 13f79535-47bb-0310-9956-ffa450edef68

commit f6cd10fa70e757e53aef8be8c179ad638dd73e94
Author: Lewis John McGibbney <lewismc@apache.org>
Date:   2013-12-23T17:17:53Z

    NUTCH-1360 Support the storing of IP address connected to when web crawling
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1553154 13f79535-47bb-0310-9956-ffa450edef68

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

Mime
View raw message