manifoldcf-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Karl Wright <daddy...@gmail.com>
Subject Re: Web crawling causes Socket Timeout after Database Exception
Date Mon, 10 Dec 2012 07:44:55 GMT
I've checked in a fix for this problem.

Karl

On Mon, Dec 10, 2012 at 1:27 AM, Shigeki Kobayashi
<shigeki.kobayashi3@g.softbank.co.jp> wrote:
> Hi Karl.
>
>
> I ran MCF1.1dev downloaded at Nov, 29th. As I was crawling files on Windows
> file server,
> there was Database Exception as following:
>
> -----------------------
> 012/12/07 23:08:14 ERROR (Stuffer thread) - Stuffer thread aborting and
> restarting due to database connection reset: Database exception:
> SQLException doing query (41000): Lock wait timeout exceeded; try restarting
> transaction
> org.apache.manifoldcf.core.interfaces.ManifoldCFException: Database
> exception: SQLException doing query (41000): Lock wait timeout exceeded; try
> restarting transaction
>         at
> org.apache.manifoldcf.core.database.Database.executeViaThread(Database.java:681)
>         at
> org.apache.manifoldcf.core.database.Database.executeUncachedQuery(Database.java:709)
>         at
> org.apache.manifoldcf.core.database.Database$QueryCacheExecutor.create(Database.java:1394)
>         at
> org.apache.manifoldcf.core.cachemanager.CacheManager.findObjectsAndExecute(CacheManager.java:144)
>         at
> org.apache.manifoldcf.core.database.Database.executeQuery(Database.java:186)
>         at
> org.apache.manifoldcf.core.database.DBInterfaceMySQL.performQuery(DBInterfaceMySQL.java:882)
>         at
> org.apache.manifoldcf.crawler.jobs.JobManager.fetchAndProcessDocuments(JobManager.java:2260)
>         at
> org.apache.manifoldcf.crawler.jobs.JobManager.getNextDocuments(JobManager.java:2066)
>         at
> org.apache.manifoldcf.crawler.system.StufferThread.run(StufferThread.java:157)
> Caused by: java.sql.SQLException: Lock wait timeout exceeded; try restarting
> transaction
>         at com.mysql.jdbc.SQLError.createSQLException(SQLError.java:1073)
>         at com.mysql.jdbc.MysqlIO.checkErrorPacket(MysqlIO.java:3609)
>         at com.mysql.jdbc.MysqlIO.checkErrorPacket(MysqlIO.java:3541)
>         at com.mysql.jdbc.MysqlIO.sendCommand(MysqlIO.java:2002)
>         at com.mysql.jdbc.MysqlIO.sqlQueryDirect(MysqlIO.java:2163)
>         at com.mysql.jdbc.ConnectionImpl.execSQL(ConnectionImpl.java:2624)
>         at
> com.mysql.jdbc.PreparedStatement.executeInternal(PreparedStatement.java:2127)
>         at
> com.mysql.jdbc.PreparedStatement.executeQuery(PreparedStatement.java:2293)
>         at
> org.apache.manifoldcf.core.database.Database.execute(Database.java:826)
>         at
> org.apache.manifoldcf.core.database.Database$ExecuteQueryThread.run(Database.java:641)
> -----------------------
>
> I do not think the job was stopped at this moment, but since the log above
> was output into the log file,
> I am wondering if the exception was handled correctly.
>
>
> I run MCF using MySQL5.5.23.
>
> Regards,
>
> Shigeki
>
>
> 2012/11/30 Karl Wright <daddywri@gmail.com>
>>
>> Hi Shigeki,
>>
>> I noticed that your crawl is using hopcount filtering.  This feature
>> is costly performance-wise.  If you can crawl with hopcount filtering
>> disabled, your crawl will be much faster.
>>
>> To disable completely, select the radio button titled
>> "読込めないコンテンツ情報は永久保存", and leave the hopcount fields blank.
>>
>> Thanks,
>> Karl
>>
>> On Fri, Nov 30, 2012 at 1:57 AM, Shigeki Kobayashi
>> <shigeki.kobayashi3@g.softbank.co.jp> wrote:
>> > Hi, Karl
>> >
>> >
>> > I think handling MySQL exception keeps MCF crawling contents. However,
>> > because of deadlocks, crawling speed would be remained slow. I think the
>> > fundamental solution of the problem is to reduce deadlocks in MySQL. I
>> > am
>> > not sure if this could be solved by MCF but this is a task that people
>> > using
>> > MySQL  need to know.
>> >
>> >
>> > Regards,
>> >
>> >
>> > Shigeki
>> >
>> >
>> > 2012/11/28 Karl Wright <daddywri@gmail.com>
>> >>
>> >> Yes, the SQL code will be output to the manifoldcf.log as part of the
>> >> exception text.
>> >>
>> >> However I hope that this checkin will already fix your problem.
>> >>
>> >> Thanks,
>> >> Karl
>> >>
>> >> On Wed, Nov 28, 2012 at 3:44 AM, Shigeki Kobayashi
>> >> <shigeki.kobayashi3@g.softbank.co.jp> wrote:
>> >> >
>> >> > Hi Karl,
>> >> >
>> >> > I can try. To obtain the error code, could you let me know what to
>> >> > code
>> >> > to
>> >> > put in what line of what file? I suppose the error code will be
>> >> > output
>> >> > into
>> >> > manifoldcf.log, is this right?
>> >> >
>> >> >
>> >> > Regards,
>> >> >
>> >> >
>> >> > Shigeki
>> >> >
>> >> >
>> >> >
>> >> > 2012/11/28 Karl Wright <daddywri@gmail.com>
>> >> >>
>> >> >> Hi Shigeki,
>> >> >>
>> >> >> This confirms my theory that our MySQL driver is not detecting all
>> >> >> cases where MySQL gives up on a transaction.  We need to correct
>> >> >> this,
>> >> >> but in order to do that we need the SQL error code that MySQL throws
>> >> >> in this case:
>> >> >>
>> >> >> Caused by: java.sql.SQLException: Lock wait timeout exceeded; try
>> >> >> restarting transaction
>> >> >>
>> >> >> It looks like somebody actually posted the SQL error code that MYSQL
>> >> >> sends out with this online:
>> >> >>
>> >> >> ERROR 1205 (HY000): Lock wait timeout exceeded; try restarting
>> >> >> transaction
>> >> >>
>> >> >> Are you able to build ManifoldCF?  I will check in a fix to trunk
>> >> >> for
>> >> >> this problem shortly; it would be great if you could try it out.
>> >> >>
>> >> >> Thanks,
>> >> >> Karl
>> >> >>
>> >> >> On Wed, Nov 28, 2012 at 2:30 AM, Shigeki Kobayashi
>> >> >> <shigeki.kobayashi3@g.softbank.co.jp> wrote:
>> >> >> > Hi Karl,
>> >> >> >
>> >> >> >
>> >> >> > Here is a log of Database Exception that is occurred while
>> >> >> > crawling
>> >> >> > Web.
>> >> >> > This time, socket timeout exception did not happen so it might be
>> >> >> > a
>> >> >> > different matter.
>> >> >> > Even though the job status remain "Running", it seems that MCF
>> >> >> > stopped
>> >> >> > crawling (The job was not aborted).
>> >> >> > --------------------------------
>> >> >> > ERROR 2012-11-22 19:36:28,593 (Worker thread '16') - Worker thread
>> >> >> > aborting
>> >> >> > and restarting due to database connection reset: Database
>> >> >> > exception:
>> >> >> > Exception doing query: Lock wait timeout exceeded; try restarting
>> >> >> > transaction
>> >> >> > org.apache.manifoldcf.core.interfaces.ManifoldCFException:
>> >> >> > Database
>> >> >> > exception: Exception doing query: Lock wait timeout exceeded; try
>> >> >> > restarting
>> >> >> > transaction
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.core.database.Database.executeViaThread(Database.java:681)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.core.database.Database.executeUncachedQuery(Database.java:709)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.core.database.Database$QueryCacheExecutor.create(Database.java:1394)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.core.cachemanager.CacheManager.findObjectsAndExecute(CacheManager.java:144)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.core.database.Database.executeQuery(Database.java:186)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.core.database.DBInterfaceMySQL.performModification(DBInterfaceMySQL.java:678)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.core.database.DBInterfaceMySQL.performUpdate(DBInterfaceMySQL.java:275)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.core.database.BaseTable.performUpdate(BaseTable.java:80)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.crawler.jobs.HopCount.markForDelete(HopCount.java:1426)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.crawler.jobs.HopCount.doDeleteInvalidation(HopCount.java:1356)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.crawler.jobs.HopCount.doFinish(HopCount.java:1057)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.crawler.jobs.HopCount.finishParents(HopCount.java:389)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.crawler.jobs.JobManager.finishDocuments(JobManager.java:4309)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:557)
>> >> >> > Caused by: java.sql.SQLException: Lock wait timeout exceeded; try
>> >> >> > restarting
>> >> >> > transaction
>> >> >> >         at
>> >> >> > com.mysql.jdbc.SQLError.createSQLException(SQLError.java:1073)
>> >> >> >         at
>> >> >> > com.mysql.jdbc.MysqlIO.checkErrorPacket(MysqlIO.java:3609)
>> >> >> >         at
>> >> >> > com.mysql.jdbc.MysqlIO.checkErrorPacket(MysqlIO.java:3541)
>> >> >> >         at com.mysql.jdbc.MysqlIO.sendCommand(MysqlIO.java:2002)
>> >> >> >         at
>> >> >> > com.mysql.jdbc.MysqlIO.sqlQueryDirect(MysqlIO.java:2163)
>> >> >> >         at
>> >> >> > com.mysql.jdbc.ConnectionImpl.execSQL(ConnectionImpl.java:2624)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > com.mysql.jdbc.PreparedStatement.executeInternal(PreparedStatement.java:2127)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > com.mysql.jdbc.PreparedStatement.executeUpdate(PreparedStatement.java:2427)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > com.mysql.jdbc.PreparedStatement.executeUpdate(PreparedStatement.java:2345)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > com.mysql.jdbc.PreparedStatement.executeUpdate(PreparedStatement.java:2330)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.core.database.Database.execute(Database.java:840)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.core.database.Database$ExecuteQueryThread.run(Database.java:641)
>> >> >> >
>> >> >> > --------------------------------
>> >> >> >
>> >> >> >
>> >> >> > Here is a log of Database Exception that is occurred while
>> >> >> > crawling
>> >> >> > files
>> >> >> > using Windows shares connection:
>> >> >> >
>> >> >> >
>> >> >> > --------------------------------
>> >> >> > 2012/11/22 23:39:28 ERROR (Job start thread) - Job start thread
>> >> >> > aborting
>> >> >> > and
>> >> >> > restarting due to database connection reset: Database exception:
>> >> >> > Exception
>> >> >> > doing query: Lock wait timeout exceeded; try restarting
>> >> >> > transaction
>> >> >> > org.apache.manifoldcf.core.interfaces.ManifoldCFException:
>> >> >> > Database
>> >> >> > exception: Exception doing query: Lock wait timeout exceeded; try
>> >> >> > restarting
>> >> >> > transaction
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.core.database.Database.executeViaThread(Database.java:681)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.core.database.Database.executeUncachedQuery(Database.java:709)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.core.database.Database$QueryCacheExecutor.create(Database.java:1394)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.core.cachemanager.CacheManager.findObjectsAndExecute(CacheManager.java:144)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.core.database.Database.executeQuery(Database.java:186)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.core.database.DBInterfaceMySQL.performQuery(DBInterfaceMySQL.java:852)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.crawler.jobs.JobManager.startJobs(JobManager.java:4711)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.crawler.system.JobStartThread.run(JobStartThread.java:68)
>> >> >> > Caused by: java.sql.SQLException: Lock wait timeout exceeded; try
>> >> >> > restarting
>> >> >> > transaction
>> >> >> >         at
>> >> >> > com.mysql.jdbc.SQLError.createSQLException(SQLError.java:1073)
>> >> >> >         at
>> >> >> > com.mysql.jdbc.MysqlIO.checkErrorPacket(MysqlIO.java:3609)
>> >> >> >         at com.mysql.jdbc.MysqlIO.nextRowFast(MysqlIO.java:1578)
>> >> >> >         at com.mysql.jdbc.MysqlIO.nextRow(MysqlIO.java:1434)
>> >> >> >         at
>> >> >> > com.mysql.jdbc.MysqlIO.readSingleRowSet(MysqlIO.java:2925)
>> >> >> >         at com.mysql.jdbc.MysqlIO.getResultSet(MysqlIO.java:477)
>> >> >> >         at
>> >> >> >
>> >> >> > com.mysql.jdbc.MysqlIO.readResultsForQueryOrUpdate(MysqlIO.java:2631)
>> >> >> >         at
>> >> >> > com.mysql.jdbc.MysqlIO.readAllResults(MysqlIO.java:1800)
>> >> >> >         at
>> >> >> > com.mysql.jdbc.MysqlIO.sqlQueryDirect(MysqlIO.java:2221)
>> >> >> >         at
>> >> >> > com.mysql.jdbc.ConnectionImpl.execSQL(ConnectionImpl.java:2624)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > com.mysql.jdbc.PreparedStatement.executeInternal(PreparedStatement.java:2127)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > com.mysql.jdbc.PreparedStatement.executeQuery(PreparedStatement.java:2293)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.core.database.Database.execute(Database.java:826)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.core.database.Database$ExecuteQueryThread.run(Database.java:641)
>> >> >> > 2012/11/22 23:39:28 ERROR (Finisher thread) - Finisher thread
>> >> >> > aborting
>> >> >> > and
>> >> >> > restarting due to database connection reset: Database exception:
>> >> >> > Exception
>> >> >> > doing query: Lock wait timeout exceeded; try restarting
>> >> >> > transaction
>> >> >> > org.apache.manifoldcf.core.interfaces.ManifoldCFException:
>> >> >> > Database
>> >> >> > exception: Exception doing query: Lock wait timeout exceeded; try
>> >> >> > restarting
>> >> >> > transaction
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.core.database.Database.executeViaThread(Database.java:681)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.core.database.Database.executeUncachedQuery(Database.java:709)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.core.database.Database$QueryCacheExecutor.create(Database.java:1394)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.core.cachemanager.CacheManager.findObjectsAndExecute(CacheManager.java:144)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.core.database.Database.executeQuery(Database.java:186)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.core.database.DBInterfaceMySQL.performQuery(DBInterfaceMySQL.java:852)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.crawler.jobs.JobManager.finishJobs(JobManager.java:6469)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.crawler.system.FinisherThread.run(FinisherThread.java:64)
>> >> >> > Caused by: java.sql.SQLException: Lock wait timeout exceeded; try
>> >> >> > restarting
>> >> >> > transaction
>> >> >> >         at
>> >> >> > com.mysql.jdbc.SQLError.createSQLException(SQLError.java:1073)
>> >> >> >         at
>> >> >> > com.mysql.jdbc.MysqlIO.checkErrorPacket(MysqlIO.java:3609)
>> >> >> >         at com.mysql.jdbc.MysqlIO.nextRowFast(MysqlIO.java:1578)
>> >> >> >         at com.mysql.jdbc.MysqlIO.nextRow(MysqlIO.java:1434)
>> >> >> >         at
>> >> >> > com.mysql.jdbc.MysqlIO.readSingleRowSet(MysqlIO.java:2925)
>> >> >> >         at com.mysql.jdbc.MysqlIO.getResultSet(MysqlIO.java:477)
>> >> >> >         at
>> >> >> >
>> >> >> > com.mysql.jdbc.MysqlIO.readResultsForQueryOrUpdate(MysqlIO.java:2631)
>> >> >> >         at
>> >> >> > com.mysql.jdbc.MysqlIO.readAllResults(MysqlIO.java:1800)
>> >> >> >         at
>> >> >> > com.mysql.jdbc.MysqlIO.sqlQueryDirect(MysqlIO.java:2221)
>> >> >> >         at
>> >> >> > com.mysql.jdbc.ConnectionImpl.execSQL(ConnectionImpl.java:2624)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > com.mysql.jdbc.PreparedStatement.executeInternal(PreparedStatement.java:2127)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > com.mysql.jdbc.PreparedStatement.executeQuery(PreparedStatement.java:2293)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.core.database.Database.execute(Database.java:826)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.core.database.Database$ExecuteQueryThread.run(Database.java:641)
>> >> >> > 2012/11/22 23:39:30 ERROR (Worker thread '253') - Worker thread
>> >> >> > aborting
>> >> >> > and
>> >> >> > restarting due to database connection reset: Database exception:
>> >> >> > Exception
>> >> >> > doing query: Lock wait timeout exceeded; try restarting
>> >> >> > transaction
>> >> >> > org.apache.manifoldcf.core.interfaces.ManifoldCFException:
>> >> >> > Database
>> >> >> > exception: Exception doing query: Lock wait timeout exceeded; try
>> >> >> > restarting
>> >> >> > transaction
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.core.database.Database.executeViaThread(Database.java:681)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.core.database.Database.executeUncachedQuery(Database.java:709)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.core.database.Database$QueryCacheExecutor.create(Database.java:1394)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.core.cachemanager.CacheManager.findObjectsAndExecute(CacheManager.java:144)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.core.database.Database.executeQuery(Database.java:186)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.core.database.DBInterfaceMySQL.performModification(DBInterfaceMySQL.java:678)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.core.database.DBInterfaceMySQL.performUpdate(DBInterfaceMySQL.java:275)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.core.database.BaseTable.performUpdate(BaseTable.java:80)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.crawler.jobs.JobQueue.updateCompletedRecord(JobQueue.java:722)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.crawler.jobs.JobManager.markDocumentCompletedMultiple(JobManager.java:2435)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:765)
>> >> >> > Caused by: java.sql.SQLException: Lock wait timeout exceeded; try
>> >> >> > restarting
>> >> >> > transaction
>> >> >> >         at
>> >> >> > com.mysql.jdbc.SQLError.createSQLException(SQLError.java:1073)
>> >> >> >         at
>> >> >> > com.mysql.jdbc.MysqlIO.checkErrorPacket(MysqlIO.java:3609)
>> >> >> >         at
>> >> >> > com.mysql.jdbc.MysqlIO.checkErrorPacket(MysqlIO.java:3541)
>> >> >> >         at com.mysql.jdbc.MysqlIO.sendCommand(MysqlIO.java:2002)
>> >> >> >         at
>> >> >> > com.mysql.jdbc.MysqlIO.sqlQueryDirect(MysqlIO.java:2163)
>> >> >> >         at
>> >> >> > com.mysql.jdbc.ConnectionImpl.execSQL(ConnectionImpl.java:2624)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > com.mysql.jdbc.PreparedStatement.executeInternal(PreparedStatement.java:2127)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > com.mysql.jdbc.PreparedStatement.executeUpdate(PreparedStatement.java:2427)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > com.mysql.jdbc.PreparedStatement.executeUpdate(PreparedStatement.java:2345)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > com.mysql.jdbc.PreparedStatement.executeUpdate(PreparedStatement.java:2330)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.core.database.Database.execute(Database.java:840)
>> >> >> >         at
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> > org.apache.manifoldcf.core.database.Database$ExecuteQueryThread.run(Database.java:641)
>> >> >> > --------------------------------
>> >> >> > Regards,
>> >> >> >
>> >> >> > Shigeki
>> >> >> >
>> >> >> >
>> >> >> > 2012/11/27 Karl Wright <daddywri@gmail.com>
>> >> >> >>
>> >> >> >> Hi Shigeki,
>> >> >> >>
>> >> >> >> Deadlocks are a fact of life in a very multithreaded application.
>> >> >> >> They are supposed to be caught by ManifoldCF, and the
>> >> >> >> transactions
>> >> >> >> retried.  I can believe, though, that MySQL might set different
>> >> >> >> sqlexception status codes for different kinds of deadlock - if
>> >> >> >> instead
>> >> >> >> of a sqlexception with a deadlock code, MySQL sometimes just
>> >> >> >> drops
>> >> >> >> the
>> >> >> >> JDBC connection, that might explain the problem.
>> >> >> >>
>> >> >> >> Can you refresh my memory and please send the ManifoldCF log part
>> >> >> >> that
>> >> >> >> includes the socket timeout exception?  I can then see if it is
>> >> >> >> coming
>> >> >> >> from the same place.
>> >> >> >>
>> >> >> >> Karl
>> >> >> >>
>> >> >> >> On Tue, Nov 27, 2012 at 12:50 AM, Shigeki Kobayashi
>> >> >> >> <shigeki.kobayashi3@g.softbank.co.jp> wrote:
>> >> >> >> > Hi Karl,
>> >> >> >> >
>> >> >> >> > According to INNODB STATUS in MySQL, while crawling web, the
>> >> >> >> > following
>> >> >> >> > DEADLOCK occurred.
>> >> >> >> > A few minutes later, database exception occurred in MCF.
>> >> >> >> > So do you think probably this DEADLOCK could cause the
>> >> >> >> > exception?
>> >> >> >> >
>> >> >> >> > I do not know the error code from MySQL yet, but maybe I could
>> >> >> >> > obtain
>> >> >> >> > it
>> >> >> >> > if
>> >> >> >> > you could let me
>> >> >> >> > know what code in what file should be added in order to output
>> >> >> >> > the
>> >> >> >> > error
>> >> >> >> > code into manifold.log
>> >> >> >> >
>> >> >> >> >
>> >> >> >> >
>> >> >> >> >
>> >> >> >> >
>> >> >> >> >
>> >> >> >> > ------------------------------------------------------------------------------
>> >> >> >> > INNODB STATUS:
>> >> >> >> > =====================================
>> >> >> >> > 121127 11:17:49 INNODB MONITOR OUTPUT
>> >> >> >> > =====================================
>> >> >> >> > Per second averages calculated from the last 60 seconds
>> >> >> >> > -----------------
>> >> >> >> > BACKGROUND THREAD
>> >> >> >> > -----------------
>> >> >> >> > srv_master_thread loops: 401163 1_second, 401162 sleeps, 40049
>> >> >> >> > 10_second,
>> >> >> >> > 674 background, 674 flush
>> >> >> >> > srv_master_thread log flush and writes: 401182
>> >> >> >> > ----------
>> >> >> >> > SEMAPHORES
>> >> >> >> > ----------
>> >> >> >> > OS WAIT ARRAY INFO: reservation count 7319, signal count 6842
>> >> >> >> > Mutex spin waits 3702, rounds 111120, OS waits 3626
>> >> >> >> > RW-shared spins 2189, rounds 63516, OS waits 1767
>> >> >> >> > RW-excl spins 255, rounds 57147, OS waits 1897
>> >> >> >> > Spin rounds per wait: 30.02 mutex, 29.02 RW-shared, 224.11
>> >> >> >> > RW-excl
>> >> >> >> > ------------------------
>> >> >> >> > LATEST DETECTED DEADLOCK
>> >> >> >> > ------------------------
>> >> >> >> > 121122 19:31:55
>> >> >> >> > *** (1) TRANSACTION:
>> >> >> >> > TRANSACTION 3021A0, ACTIVE 32 sec starting index read
>> >> >> >> > mysql tables in use 1, locked 1
>> >> >> >> > LOCK WAIT 64 lock struct(s), heap size 14776, 110 row lock(s),
>> >> >> >> > undo
>> >> >> >> > log
>> >> >> >> > entries 51
>> >> >> >> > MySQL thread id 24, OS thread handle 0x7ff8ffe06700, query id
>> >> >> >> > 41385
>> >> >> >> > 10.249.23.9 manifoldcf Sending data
>> >> >> >> > SELECT parentidhash,linktype,distance FROM hopcount WHERE
>> >> >> >> > jobid=1351139121625 AND parentidhash IN
>> >> >> >> >
>> >> >> >> >
>> >> >> >> >
>> >> >> >> >
>> >> >> >> > ('A0ED08F9D45547FF54B72869FE5E7C3C5B0E910A','F5E2F6C6B43FB5D030C4F0AE8E22AD07536475A8','C0856A1AFF55F7BB20BCAE317E18F588EEFB806D','019253D99FCB265A20A3CFF11D0443937FE2D4D0','21A979F9BB9120F747B0B605EDABA71EB364A584','A8B5B7245D0810584B764470B42CFDF71C33A7E5','6FE272988943D3BD64E285951A1A6739011FC15E','1E1CA954A3E31BFC28FFE1BE70757408341CDB6A','8DAE8B4734A30FE2D346EEBD1CCC3A16468F7B7B','028CD3E7FF7F493E3EC3980FF303DB05DC42404E','924E0608A5C4505C9272A69B8C1F82C7B883A11F','13F6402C96E0979EF5F17338DFF96BD9912125D9','43174E34AA07C34237D622A43A82AFE3825C3870','32892282A6866BD181BDA0BA85801192370C84F3','0311197289655163E1452E90D43A5D96D9A4E751','178C8BE84AEDC9F362CE3A2CC2702F6C2CD9CBA1','7CF15B193B3BDA097BEB437272FC5E413B86B63D')
>> >> >> >> > AND linktype IN ('link','redirect')
>> >> >> >> > *** (1) WAITING FOR THIS LOCK TO BE GRANTED:
>> >> >> >> > RECORD LOCKS space id 0 page no 2449 n bits 192 index `PRIMARY`
>> >> >> >> > of
>> >> >> >> > table
>> >> >> >> > `manifoldcf`.`hopcount` trx id 3021A0 lock mode S locks rec but
>> >> >> >> > not
>> >> >> >> > gap
>> >> >> >> > waiting
>> >> >> >> > Record lock, heap no 28 PHYSICAL RECORD: n_fields 8; compact
>> >> >> >> > format;
>> >> >> >> > info
>> >> >> >> > bits 0
>> >> >> >> >  0: len 8; hex 8000013b261c6d8a; asc    ;& m ;;
>> >> >> >> >  1: len 6; hex 0000002f4e47; asc    /NG;;
>> >> >> >> >  2: len 7; hex 900000021b06ec; asc        ;;
>> >> >> >> >  3: len 1; hex 4e; asc N;;
>> >> >> >> >  4: len 30; hex
>> >> >> >> > 384441453842343733344133304645324433343645454244314343433341;
>> >> >> >> > asc
>> >> >> >> > 8DAE8B4734A30FE2D346EEBD1CCC3A; (total 40 bytes);
>> >> >> >> >  5: len 8; hex 8000000000000001; asc         ;;
>> >> >> >> >  6: len 8; hex 8000013a962ad9d9; asc    : *  ;;
>> >> >> >> >  7: len 4; hex 6c696e6b; asc link;;
>> >> >> >> >
>> >> >> >> > *** (2) TRANSACTION:
>> >> >> >> > TRANSACTION 302208, ACTIVE 3 sec fetching rows
>> >> >> >> > mysql tables in use 3, locked 3
>> >> >> >> > 1436 lock struct(s), heap size 145848, 122906 row lock(s)
>> >> >> >> > MySQL thread id 39, OS thread handle 0x7ff8ffa37700, query id
>> >> >> >> > 40699
>> >> >> >> > 10.249.23.9 manifoldcf preparing
>> >> >> >> > UPDATE hopcount SET deathmark='D',distance=-1 WHERE id
>> >> >> >> > IN(SELECT
>> >> >> >> > ownerid
>> >> >> >> > FROM hopdeletedeps t0 WHERE t0.jobid=1351139121625 AND
>> >> >> >> > t0.childidhash='D573BDC6D59C7A7CC2862646322F69EA5574C36D' AND
>> >> >> >> > EXISTS(SELECT
>> >> >> >> > 'x' FROM intrinsiclink t1 WHERE t1.jobid=t0.jobid AND
>> >> >> >> > t1.linktype=t0.linktype AND t1.parentidhash=t0.parentidhash AND
>> >> >> >> > t1.childidhash=t0.childidhash AND t1.isnew='B'))
>> >> >> >> > *** (2) HOLDS THE LOCK(S):
>> >> >> >> > RECORD LOCKS space id 0 page no 2449 n bits 192 index `PRIMARY`
>> >> >> >> > of
>> >> >> >> > table
>> >> >> >> > `manifoldcf`.`hopcount` trx id 302208 lock_mode X
>> >> >> >> > Record lock, heap no 1 PHYSICAL RECORD: n_fields 1; compact
>> >> >> >> > format;
>> >> >> >> > info
>> >> >> >> > bits 0
>> >> >> >> >  0: len 8; hex 73757072656d756d; asc supremum;;
>> >> >> >> >
>> >> >> >> > ...
>> >> >> >> > ...
>> >> >> >> > ...
>> >> >> >> >
>> >> >> >> > *** WE ROLL BACK TRANSACTION (1)
>> >> >> >> > ------------
>> >> >> >> > TRANSACTIONS
>> >> >> >> > ------------
>> >> >> >> > Trx id counter 38375F
>> >> >> >> > Purge done for trx's n:o < 3024F7 undo n:o < 0
>> >> >> >> > History list length 652
>> >> >> >> > LIST OF TRANSACTIONS FOR EACH SESSION:
>> >> >> >> > ---TRANSACTION 0, not started
>> >> >> >> > MySQL thread id 110, OS thread handle 0x7ff914113700, query id
>> >> >> >> > 1436936
>> >> >> >> > localhost root
>> >> >> >> > SHOW ENGINE INNODB STATUS
>> >> >> >> > ---TRANSACTION 0, not started
>> >> >> >> > MySQL thread id 106, OS thread handle 0x7ff9035b3700, query id
>> >> >> >> > 1435785
>> >> >> >> > localhost root
>> >> >> >> > ---TRANSACTION 38375E, not started
>> >> >> >> > MySQL thread id 99, OS thread handle 0x7ff8ff72b700, query id
>> >> >> >> > 1436934
>> >> >> >> > 10.249.23.9 manifoldcf
>> >> >> >> > --------
>> >> >> >> > ...
>> >> >> >> >
>> >> >> >> >
>> >> >> >> >
>> >> >> >> >
>> >> >> >> > ------------------------------------------------------------------------------
>> >> >> >> >
>> >> >> >> >
>> >> >> >> > Likewise, file crawling using Windows shares faced similar
>> >> >> >> > matter.
>> >> >> >> > DEADLOCK
>> >> >> >> > occured in MySQL and
>> >> >> >> > Database Exception occured in MCF as well:
>> >> >> >> >
>> >> >> >> >
>> >> >> >> >
>> >> >> >> >
>> >> >> >> >
>> >> >> >> > ------------------------------------------------------------------------------
>> >> >> >> >
>> >> >> >> > | InnoDB |      |
>> >> >> >> > =====================================
>> >> >> >> > 121126 16:05:21 INNODB MONITOR OUTPUT
>> >> >> >> > =====================================
>> >> >> >> > Per second averages calculated from the last 48 seconds
>> >> >> >> > -----------------
>> >> >> >> > BACKGROUND THREAD
>> >> >> >> > -----------------
>> >> >> >> > srv_master_thread loops: 327427 1_second, 327300 sleeps, 32438
>> >> >> >> > 10_second,
>> >> >> >> > 3544 background, 3544 flush
>> >> >> >> > srv_master_thread log flush and writes: 327670
>> >> >> >> > ----------
>> >> >> >> > SEMAPHORES
>> >> >> >> > ----------
>> >> >> >> > OS WAIT ARRAY INFO: reservation count 1808090, signal count
>> >> >> >> > 2140762
>> >> >> >> > Mutex spin waits 18194682, rounds 103331992, OS waits 842070
>> >> >> >> > RW-shared spins 1311114, rounds 25796436, OS waits 457767
>> >> >> >> > RW-excl spins 577964, rounds 15904805, OS waits 333210
>> >> >> >> > Spin rounds per wait: 5.68 mutex, 19.68 RW-shared, 27.52
>> >> >> >> > RW-excl
>> >> >> >> > ------------------------
>> >> >> >> > LATEST DETECTED DEADLOCK
>> >> >> >> > ------------------------
>> >> >> >> > 121122 23:38:46
>> >> >> >> > *** (1) TRANSACTION:
>> >> >> >> > TRANSACTION 674749, ACTIVE 7 sec inserting
>> >> >> >> > mysql tables in use 1, locked 1
>> >> >> >> > LOCK WAIT 4 lock struct(s), heap size 1248, 3 row lock(s), undo
>> >> >> >> > log
>> >> >> >> > entries
>> >> >> >> > 1
>> >> >> >> > MySQL thread id 99, OS thread handle 0x7f7d4a356700, query id
>> >> >> >> > 23942404
>> >> >> >> > localhost 127.0.0.1 manifoldcf update
>> >> >> >> > INSERT INTO jobqueue
>> >> >> >> >
>> >> >> >> >
>> >> >> >> >
>> >> >> >> >
>> >> >> >> > (docpriority,id,priorityset,docid,status,dochash,checktime,checkaction,jobid)
>> >> >> >> > VALUES
>> >> >> >> >
>> >> >> >> >
>> >> >> >> >
>> >> >> >> >
>> >> >> >> > (13.830866056523654,1353595119848,1353595119385,'smb://xxx/xxx','P','88517951DB2E0666151E7B5308C9FDCB16F062AD',0,'R',1353575409046)
>> >> >> >> > *** (1) WAITING FOR THIS LOCK TO BE GRANTED:
>> >> >> >> > RECORD LOCKS space id 0 page no 221894 n bits 208 index
>> >> >> >> > `I1352346865065`
>> >> >> >> > of
>> >> >> >> > table `manifoldcf`.`jobqueue` trx id 674749 lock_mode X locks
>> >> >> >> > gap
>> >> >> >> > before
>> >> >> >> > rec
>> >> >> >> > insert intention waiting
>> >> >> >> > Record lock, heap no 134 PHYSICAL RECORD: n_fields 3; compact
>> >> >> >> > format;
>> >> >> >> > info
>> >> >> >> > bits 0
>> >> >> >> >  0: len 30; hex
>> >> >> >> > 383835313838444433453134444134354242384531383433424330393444;
>> >> >> >> > asc
>> >> >> >> > 885188DD3E14DA45BB8E1843BC094D; (total 40 bytes);
>> >> >> >> >  1: len 8; hex 8000013b2761a596; asc    ;'a  ;;
>> >> >> >> >  2: len 8; hex 8000013b287bd5c9; asc    ;({  ;;
>> >> >> >> >
>> >> >> >> > *** (2) TRANSACTION:
>> >> >> >> > TRANSACTION 6740DF, ACTIVE 9 sec fetching rows
>> >> >> >> > mysql tables in use 5, locked 5
>> >> >> >> > 23571 lock struct(s), heap size 2439608, 1058037 row lock(s)
>> >> >> >> > MySQL thread id 45, OS thread handle 0x7f7d21231700, query id
>> >> >> >> > 23937374
>> >> >> >> > localhost 127.0.0.1 manifoldcf Sending data
>> >> >> >> > SELECT
>> >> >> >> >
>> >> >> >> >
>> >> >> >> >
>> >> >> >> >
>> >> >> >> > t0.id,t0.jobid,t0.dochash,t0.docid,t0.status,t0.failtime,t0.failcount,t0.priorityset
>> >> >> >> > FROM jobqueue t0 WHERE t0.status IN ('P','G') AND
>> >> >> >> > t0.checkaction='R'
>> >> >> >> > AND
>> >> >> >> > t0.checktime<=1353595117855 AND EXISTS(SELECT 'x' FROM jobs t1
>> >> >> >> > WHERE
>> >> >> >> > t1.status IN ('A','a') AND t1.id=t0.jobid AND t1.priority=5)
>> >> >> >> > AND
>> >> >> >> > NOT
>> >> >> >> > EXISTS(SELECT 'x' FROM jobqueue t2 WHERE t2.dochash=t0.dochash
>> >> >> >> > AND
>> >> >> >> > t2.status
>> >> >> >> > IN ('A','F','a','f','D','d') AND t2.jobid!=t0.jobid) AND NOT
>> >> >> >> > EXISTS(SELECT
>> >> >> >> > 'x' FROM prereqevents t3,events t4 WHERE t0.id=t3.owner AND
>> >> >> >> > t3.eventname=t4.name) ORDER BY t0.docpriority ASC,t0.status
>> >> >> >> > ASC,t0.checkaction ASC,t0.checktime ASC LIMIT 1200
>> >> >> >> > *** (2) HOLDS THE LOCK(S):
>> >> >> >> > RECORD LOCKS space id 0 page no 221894 n bits 208 index
>> >> >> >> > `I1352346865065`
>> >> >> >> > of
>> >> >> >> > table `manifoldcf`.`jobqueue` trx id 6740DF lock mode S locks
>> >> >> >> > gap
>> >> >> >> > before
>> >> >> >> > rec
>> >> >> >> > Record lock, heap no 8 PHYSICAL RECORD: n_fields 3; compact
>> >> >> >> > format;
>> >> >> >> > info
>> >> >> >> > bits 0
>> >> >> >> >  0: len 30; hex
>> >> >> >> > 383834464239393738383632333242323331353041343031303337424444;
>> >> >> >> > asc
>> >> >> >> > 884FB997886232B23150A401037BDD; (total 40 bytes);
>> >> >> >> >  1: len 8; hex 8000013b2761a596; asc    ;'a  ;;
>> >> >> >> >  2: len 8; hex 8000013b27c4823b; asc    ;'  ;;;
>> >> >> >> >
>> >> >> >> > ...
>> >> >> >> > ...
>> >> >> >> > ...
>> >> >> >> >
>> >> >> >> > *** WE ROLL BACK TRANSACTION (1)
>> >> >> >> > ------------
>> >> >> >> > TRANSACTIONS
>> >> >> >> > ------------
>> >> >> >> > Trx id counter 6ACDF6
>> >> >> >> > Purge done for trx's n:o < 6752D1 undo n:o < 0
>> >> >> >> > History list length 485
>> >> >> >> > LIST OF TRANSACTIONS FOR EACH SESSION:
>> >> >> >> > ---TRANSACTION 0, not started
>> >> >> >> > MySQL thread id 5505, OS thread handle 0x7f7d210ec700, query id
>> >> >> >> > 25071245
>> >> >> >> > localhost root
>> >> >> >> > SHOW ENGINE INNODB STATUS
>> >> >> >> > ---TRANSACTION 6ACDF5, not started
>> >> >> >> > MySQL thread id 99, OS thread handle 0x7f7d4a356700, query id
>> >> >> >> > 25071244
>> >> >> >> > localhost 127.0.0.1 manifoldcf
>> >> >> >> > ...
>> >> >> >> >
>> >> >> >> >
>> >> >> >> >
>> >> >> >> >
>> >> >> >> > ------------------------------------------------------------------------------
>> >> >> >> >
>> >> >> >> >
>> >> >> >> >
>> >> >> >> >
>> >> >> >> > Regards,
>> >> >> >> >
>> >> >> >> > Shigeki
>> >> >> >> >
>> >> >> >> >
>> >> >> >> > 2012/10/19 Shigeki Kobayashi
>> >> >> >> > <shigeki.kobayashi3@g.softbank.co.jp>
>> >> >> >> >>
>> >> >> >> >> Due to the error, I had to downgrade to a lower version so I
>> >> >> >> >> haven't
>> >> >> >> >> found
>> >> >> >> >> the MySQL error code yet.
>> >> >> >> >>
>> >> >> >> >> I installed MCF1.0 in a different environment where crawlable
>> >> >> >> >> contents
>> >> >> >> >> are
>> >> >> >> >> different from the above environment.
>> >> >> >> >> I could not reproduce the Database exception but socket
>> >> >> >> >> timeout
>> >> >> >> >> occurred
>> >> >> >> >> In the same environment, I ran MCF0.6 and it completed
>> >> >> >> >> crawling
>> >> >> >> >> without
>> >> >> >> >> socket timeout.
>> >> >> >> >> Like you said, socket timeout seems to be a different problem
>> >> >> >> >> from
>> >> >> >> >> the
>> >> >> >> >> Database exception .
>> >> >> >> >>
>> >> >> >> >> 2012/10/18 Karl Wright <daddywri@gmail.com>
>> >> >> >> >>>
>> >> >> >> >>> So, what was the resolution of this problem?  Any news?
>> >> >> >> >>> Karl
>> >> >> >> >>>
>> >> >> >> >>> On Thu, Oct 11, 2012 at 2:28 AM, Karl Wright
>> >> >> >> >>> <daddywri@gmail.com>
>> >> >> >> >>> wrote:
>> >> >> >> >>> > The only change is that the MySQL driver now performs
>> >> >> >> >>> > ANALYZE
>> >> >> >> >>> > operations on the fly in order to keep the database
>> >> >> >> >>> > operating
>> >> >> >> >>> > at
>> >> >> >> >>> > high
>> >> >> >> >>> > efficiency.  This is CONNECTORS-510.  It is possible that,
>> >> >> >> >>> > on
>> >> >> >> >>> > a
>> >> >> >> >>> > large
>> >> >> >> >>> > database table, these operations will cause others to wait
>> >> >> >> >>> > long
>> >> >> >> >>> > enough
>> >> >> >> >>> > so that their timeout is exceeded.  Such an event does not
>> >> >> >> >>> > take
>> >> >> >> >>> > place
>> >> >> >> >>> > while the load tests run, however.  If you want to turn off
>> >> >> >> >>> > the
>> >> >> >> >>> > analyze operation, you can do that by setting a per-table
>> >> >> >> >>> > property
>> >> >> >> >>> > to
>> >> >> >> >>> > override the analyze default of 10000 operations:
>> >> >> >> >>> >
>> >> >> >> >>> > analyzeThreshold =
>> >> >> >> >>> >
>> >> >> >> >>> >
>> >> >> >> >>> >
>> >> >> >> >>> >
>> >> >> >> >>> >
>> >> >> >> >>> > ManifoldCF.getIntProperty("org.apache.manifold.db.mysql.analyze."+tableName,10000);
>> >> >> >> >>> >
>> >> >> >> >>> > The table in question is "jobqueue".  If you set this value
>> >> >> >> >>> > to
>> >> >> >> >>> > something like 1000000000 and you still see MySQL timeouts,
>> >> >> >> >>> > then
>> >> >> >> >>> > this
>> >> >> >> >>> > new code is not the problem.  And, like I said, the best
>> >> >> >> >>> > solution
>> >> >> >> >>> > is
>> >> >> >> >>> > to recognize the error and retry, but first I would need
>> >> >> >> >>> > the
>> >> >> >> >>> > error
>> >> >> >> >>> > code.  Adding an appropriate output of sqlState around line
>> >> >> >> >>> > 123
>> >> >> >> >>> > of
>> >> >> >> >>> >
>> >> >> >> >>> >
>> >> >> >> >>> >
>> >> >> >> >>> >
>> >> >> >> >>> >
>> >> >> >> >>> > framework/core/src/main/java/org/apache/manifoldcf/core/database/DBInterfaceMySQL.java
>> >> >> >> >>> > would allow us to see what code to catch, when it happened
>> >> >> >> >>> > again.
>> >> >> >> >>> >
>> >> >> >> >>> > For the Web connector, the only modifications have been in
>> >> >> >> >>> > regards
>> >> >> >> >>> > to
>> >> >> >> >>> > how it handles 500 errors, which now correctly code to
>> >> >> >> >>> > avoid
>> >> >> >> >>> > an
>> >> >> >> >>> > IndexExceptionOutOfBounds exception.  This has nothing to
>> >> >> >> >>> > do
>> >> >> >> >>> > with
>> >> >> >> >>> > socket exceptions, which are caused for external reasons
>> >> >> >> >>> > only.
>> >> >> >> >>> >
>> >> >> >> >>> > Karl
>> >> >> >> >>> >
>> >> >> >> >>> >
>> >> >> >> >>> > On Wed, Oct 10, 2012 at 10:32 PM, Shigeki Kobayashi
>> >> >> >> >>> > <shigeki.kobayashi3@g.softbank.co.jp> wrote:
>> >> >> >> >>> >> Hi Karl,
>> >> >> >> >>> >>
>> >> >> >> >>> >>
>> >> >> >> >>> >> I was comparing version 1.0 with old trunk based on
>> >> >> >> >>> >> version
>> >> >> >> >>> >> 0.6
>> >> >> >> >>> >> implementing
>> >> >> >> >>> >> CONNECTORS-501(
>> >> >> >> >>> >> Medium-scale web crawl with hopcount-based filtering fails
>> >> >> >> >>> >> to
>> >> >> >> >>> >> find
>> >> >> >> >>> >> correct
>> >> >> >> >>> >> number of documents).
>> >> >> >> >>> >>
>> >> >> >> >>> >> Running each version with the same MySQL setting and the
>> >> >> >> >>> >> same
>> >> >> >> >>> >> throttling,
>> >> >> >> >>> >> somehow the version 1.0 hangs with the error.
>> >> >> >> >>> >> Since the old trunk completes crawling, I wonder if
>> >> >> >> >>> >> something
>> >> >> >> >>> >> has
>> >> >> >> >>> >> changed.
>> >> >> >> >>> >>
>> >> >> >> >>> >> Just to make sure I will recheck if there are any wrong
>> >> >> >> >>> >> settings
>> >> >> >> >>> >> in
>> >> >> >> >>> >> MCF.
>> >> >> >> >>> >>
>> >> >> >> >>> >> Thanks.
>> >> >> >> >>> >>
>> >> >> >> >>> >> Regards,
>> >> >> >> >>> >>
>> >> >> >> >>> >> Shigeki
>> >> >> >> >>> >>
>> >> >> >> >>> >> 2012/10/10 Karl Wright <daddywri@gmail.com>
>> >> >> >> >>> >>>
>> >> >> >> >>> >>> Hi Shigeki,
>> >> >> >> >>> >>>
>> >> >> >> >>> >>> The socket timeout exception is only a warning.  It means
>> >> >> >> >>> >>> that
>> >> >> >> >>> >>> some
>> >> >> >> >>> >>> site you are crawling did not accept a socket connection
>> >> >> >> >>> >>> within
>> >> >> >> >>> >>> the
>> >> >> >> >>> >>> allowed time (5 minutes I think).  The Web Connector will
>> >> >> >> >>> >>> retry
>> >> >> >> >>> >>> the
>> >> >> >> >>> >>> connection a few times, and if it is still rejected, it
>> >> >> >> >>> >>> will
>> >> >> >> >>> >>> eventually give up on that page.  One thing you want to
>> >> >> >> >>> >>> check,
>> >> >> >> >>> >>> though,
>> >> >> >> >>> >>> is that you are using proper throttling, because if you
>> >> >> >> >>> >>> aren't
>> >> >> >> >>> >>> then
>> >> >> >> >>> >>> one cause of this problem is that the webmaster of the
>> >> >> >> >>> >>> site
>> >> >> >> >>> >>> you
>> >> >> >> >>> >>> are
>> >> >> >> >>> >>> trying to crawl may have blocked you from accessing it.
>> >> >> >> >>> >>>
>> >> >> >> >>> >>> The database exception is more problematic.  It means
>> >> >> >> >>> >>> that
>> >> >> >> >>> >>> MySQL
>> >> >> >> >>> >>> thinks it took too long for a specific transaction to
>> >> >> >> >>> >>> complete,
>> >> >> >> >>> >>> and
>> >> >> >> >>> >>> the database aborted the transaction due to a timeout.
>> >> >> >> >>> >>> There
>> >> >> >> >>> >>> are
>> >> >> >> >>> >>> two
>> >> >> >> >>> >>> ways of dealing with this issue.  One way is to modify
>> >> >> >> >>> >>> your
>> >> >> >> >>> >>> MySQL
>> >> >> >> >>> >>> configuration to increase the transaction timeout value
>> >> >> >> >>> >>> to
>> >> >> >> >>> >>> some
>> >> >> >> >>> >>> high
>> >> >> >> >>> >>> number.  The second way is to modify ManifoldCF to
>> >> >> >> >>> >>> recognize
>> >> >> >> >>> >>> the
>> >> >> >> >>> >>> timeout error specifically, and cause a retry.  But in
>> >> >> >> >>> >>> order
>> >> >> >> >>> >>> to
>> >> >> >> >>> >>> do
>> >> >> >> >>> >>> the
>> >> >> >> >>> >>> latter, I would need to know what SQL error code MySQL
>> >> >> >> >>> >>> returns
>> >> >> >> >>> >>> for
>> >> >> >> >>> >>> this situation, which will mean we either need to look it
>> >> >> >> >>> >>> up
>> >> >> >> >>> >>> (if
>> >> >> >> >>> >>> we
>> >> >> >> >>> >>> can), or modify a ManifoldCF instance to log it when this
>> >> >> >> >>> >>> problem
>> >> >> >> >>> >>> occurs.
>> >> >> >> >>> >>>
>> >> >> >> >>> >>> Please let me know how you would like to proceed.
>> >> >> >> >>> >>>
>> >> >> >> >>> >>> Karl
>> >> >> >> >>> >>>
>> >> >> >> >>> >>> On Wed, Oct 10, 2012 at 3:51 AM, Shigeki Kobayashi
>> >> >> >> >>> >>> <shigeki.kobayashi3@g.softbank.co.jp> wrote:
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > Hi
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > I am having a trouble with crawling web using MCF1.0.
>> >> >> >> >>> >>> > I run MCF with MySQL 5.5 and Tomcat 6.0.
>> >> >> >> >>> >>> > It should keep crawling contents, but MCF prints the
>> >> >> >> >>> >>> > following
>> >> >> >> >>> >>> > Database
>> >> >> >> >>> >>> > exception log, then hangs.
>> >> >> >> >>> >>> > After DB Exception, Socket Time Exception occurs.
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > Anyone has faced this problem?
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > --Database Exception log:
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > ERROR 2012-10-10 16:11:05,787 (Worker thread '42') -
>> >> >> >> >>> >>> > Worker
>> >> >> >> >>> >>> > thread
>> >> >> >> >>> >>> > aborting
>> >> >> >> >>> >>> > and restarting due to database connection reset:
>> >> >> >> >>> >>> > Database
>> >> >> >> >>> >>> > exception:
>> >> >> >> >>> >>> > Exception doing query: Lock wait timeout exceeded; try
>> >> >> >> >>> >>> > restarting
>> >> >> >> >>> >>> > transaction
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.core.interfaces.ManifoldCFException:
>> >> >> >> >>> >>> > Database
>> >> >> >> >>> >>> > exception: Exception doing query: Lock wait timeout
>> >> >> >> >>> >>> > exceeded;
>> >> >> >> >>> >>> > try
>> >> >> >> >>> >>> > restarting
>> >> >> >> >>> >>> > transaction
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.core.database.Database.executeViaThread(Database.java:681)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.core.database.Database.executeUncachedQuery(Database.java:709)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.core.database.Database$QueryCacheExecutor.create(Database.java:1394)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.core.cachemanager.CacheManager.findObjectsAndExecute(CacheManager.java:144)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.core.database.Database.executeQuery(Database.java:186)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.core.database.DBInterfaceMySQL.performQuery(DBInterfaceMySQL.java:852)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.crawler.jobs.JobManager.addDocuments(JobManager.java:4089)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.processDocumentReferences(WorkerThread.java:1932)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.addDocumentReference(WorkerThread.java:1487)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.crawler.connectors.webcrawler.WebcrawlerConnector$ProcessActivityLinkHandler.noteDiscoveredLink(WebcrawlerConnector.java:6049)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.crawler.connectors.webcrawler.WebcrawlerConnector$ProcessAcivityHTMLHandler.noteAHREF(WebcrawlerConnector.java:6159)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.crawler.connectors.webcrawler.LinkParseState.noteNonscriptTag(LinkParseState.java:44)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.crawler.connectors.webcrawler.FormParseState.noteNonscriptTag(FormParseState.java:52)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.crawler.connectors.webcrawler.ScriptParseState.noteTag(ScriptParseState.java:50)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.crawler.connectors.webcrawler.BasicParseState.dealWithCharacter(BasicParseState.java:225)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.crawler.connectors.webcrawler.WebcrawlerConnector.handleHTML(WebcrawlerConnector.java:7047)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.crawler.connectors.webcrawler.WebcrawlerConnector.extractLinks(WebcrawlerConnector.java:6011)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.crawler.connectors.webcrawler.WebcrawlerConnector.processDocuments(WebcrawlerConnector.java:1282)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.crawler.connectors.BaseRepositoryConnector.processDocuments(BaseRepositoryConnector.java:423)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:551)
>> >> >> >> >>> >>> > Caused by: java.sql.SQLException: Lock wait timeout
>> >> >> >> >>> >>> > exceeded;
>> >> >> >> >>> >>> > try
>> >> >> >> >>> >>> > restarting
>> >> >> >> >>> >>> > transaction
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > com.mysql.jdbc.SQLError.createSQLException(SQLError.java:1073)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > com.mysql.jdbc.MysqlIO.checkErrorPacket(MysqlIO.java:3609)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > com.mysql.jdbc.MysqlIO.checkErrorPacket(MysqlIO.java:3541)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> > com.mysql.jdbc.MysqlIO.sendCommand(MysqlIO.java:2002)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > com.mysql.jdbc.MysqlIO.sqlQueryDirect(MysqlIO.java:2163)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > com.mysql.jdbc.ConnectionImpl.execSQL(ConnectionImpl.java:2624)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > com.mysql.jdbc.PreparedStatement.executeInternal(PreparedStatement.java:2127)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > com.mysql.jdbc.PreparedStatement.executeQuery(PreparedStatement.java:2293)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.core.database.Database.execute(Database.java:826)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.core.database.Database$ExecuteQueryThread.run(Database.java:641)
>> >> >> >> >>> >>> > ERROR 2012-10-10 16:11:06,799 (Worker thread '9') -
>> >> >> >> >>> >>> > Worker
>> >> >> >> >>> >>> > thread
>> >> >> >> >>> >>> > aborting
>> >> >> >> >>> >>> > and restarting due to database connection reset:
>> >> >> >> >>> >>> > Database
>> >> >> >> >>> >>> > exception:
>> >> >> >> >>> >>> > Exception doing query: Lock wait timeout exceeded; try
>> >> >> >> >>> >>> > restarting
>> >> >> >> >>> >>> > transaction
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.core.interfaces.ManifoldCFException:
>> >> >> >> >>> >>> > Database
>> >> >> >> >>> >>> > exception: Exception doing query: Lock wait timeout
>> >> >> >> >>> >>> > exceeded;
>> >> >> >> >>> >>> > try
>> >> >> >> >>> >>> > restarting
>> >> >> >> >>> >>> > transaction
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.core.database.Database.executeViaThread(Database.java:681)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.core.database.Database.executeUncachedQuery(Database.java:709)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.core.database.Database$QueryCacheExecutor.create(Database.java:1394)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.core.cachemanager.CacheManager.findObjectsAndExecute(CacheManager.java:144)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.core.database.Database.executeQuery(Database.java:186)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.core.database.DBInterfaceMySQL.performQuery(DBInterfaceMySQL.java:852)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.crawler.jobs.JobManager.addDocuments(JobManager.java:4089)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.processDocumentReferences(WorkerThread.java:1932)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.flush(WorkerThread.java:1863)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:554)
>> >> >> >> >>> >>> > Caused by: java.sql.SQLException: Lock wait timeout
>> >> >> >> >>> >>> > exceeded;
>> >> >> >> >>> >>> > try
>> >> >> >> >>> >>> > restarting
>> >> >> >> >>> >>> > transaction
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > com.mysql.jdbc.SQLError.createSQLException(SQLError.java:1073)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > com.mysql.jdbc.MysqlIO.checkErrorPacket(MysqlIO.java:3609)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > com.mysql.jdbc.MysqlIO.checkErrorPacket(MysqlIO.java:3541)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> > com.mysql.jdbc.MysqlIO.sendCommand(MysqlIO.java:2002)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > com.mysql.jdbc.MysqlIO.sqlQueryDirect(MysqlIO.java:2163)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > com.mysql.jdbc.ConnectionImpl.execSQL(ConnectionImpl.java:2624)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > com.mysql.jdbc.PreparedStatement.executeInternal(PreparedStatement.java:2127)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > com.mysql.jdbc.PreparedStatement.executeQuery(PreparedStatement.java:2293)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.core.database.Database.execute(Database.java:826)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.core.database.Database$ExecuteQueryThread.run(Database.java:641)
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > ---- Socket Timeout:
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > DEBUG 2012-10-10 16:16:27,256 (Worker thread '49') -
>> >> >> >> >>> >>> > Socket
>> >> >> >> >>> >>> > timeout
>> >> >> >> >>> >>> > exception trying to close connection: Read timed out
>> >> >> >> >>> >>> > java.net.SocketTimeoutException: Read timed out
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> > java.net.SocketInputStream.socketRead0(Native
>> >> >> >> >>> >>> > Method)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > java.net.SocketInputStream.read(SocketInputStream.java:129)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > java.io.BufferedInputStream.fill(BufferedInputStream.java:218)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > java.io.BufferedInputStream.read1(BufferedInputStream.java:258)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > java.io.BufferedInputStream.read(BufferedInputStream.java:317)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.commons.httpclient.ContentLengthInputStream.read(Unknown
>> >> >> >> >>> >>> > Source)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.commons.httpclient.ContentLengthInputStream.read(Unknown
>> >> >> >> >>> >>> > Source)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.commons.httpclient.ChunkedInputStream.exhaustInputStream(Unknown
>> >> >> >> >>> >>> > Source)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.commons.httpclient.ContentLengthInputStream.close(Unknown
>> >> >> >> >>> >>> > Source)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > java.io.FilterInputStream.close(FilterInputStream.java:155)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.commons.httpclient.AutoCloseInputStream.notifyWatcher(Unknown
>> >> >> >> >>> >>> > Source)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.commons.httpclient.AutoCloseInputStream.close(Unknown
>> >> >> >> >>> >>> > Source)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.crawler.connectors.webcrawler.ThrottledFetcher$ThrottledInputstream.close(ThrottledFetcher.java:2082)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.crawler.connectors.webcrawler.DataCache.addData(DataCache.java:176)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.crawler.connectors.webcrawler.WebcrawlerConnector.getDocumentVersions(WebcrawlerConnector.java:745)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:321)
>> >> >> >> >>> >>> >  INFO 2012-10-10 16:16:27,273 (Worker thread '49') -
>> >> >> >> >>> >>> > WEB:
>> >> >> >> >>> >>> > FETCH
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > URL|http://xxxxxx/...|1349852786744+600514|-104|4125|org.apache.manifoldcf.core.interfaces.ManifoldCFException|
>> >> >> >> >>> >>> > Interrupted: Socket timeout: Read timed out
>> >> >> >> >>> >>> > DEBUG 2012-10-10 16:16:27,273 (Worker thread '49') -
>> >> >> >> >>> >>> > WEB:
>> >> >> >> >>> >>> > Fetch
>> >> >> >> >>> >>> > exception
>> >> >> >> >>> >>> > for 'http://xxxxxx/...'
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.core.interfaces.ManifoldCFException:
>> >> >> >> >>> >>> > Interrupted:
>> >> >> >> >>> >>> > Socket timeout: Read timed out
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.crawler.connectors.webcrawler.ThrottledFetcher$ThrottledConnection.noteInterrupted(ThrottledFetcher.java:1818)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.crawler.connectors.webcrawler.WebcrawlerConnector.getDocumentVersions(WebcrawlerConnector.java:797)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:321)
>> >> >> >> >>> >>> > Caused by:
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.agents.interfaces.ServiceInterruption:
>> >> >> >> >>> >>> > Socket timeout: Read timed out
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.crawler.connectors.webcrawler.DataCache.addData(DataCache.java:101)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.crawler.connectors.webcrawler.WebcrawlerConnector.getDocumentVersions(WebcrawlerConnector.java:745)
>> >> >> >> >>> >>> >         ... 1 more
>> >> >> >> >>> >>> > Caused by: java.net.SocketTimeoutException: Read timed
>> >> >> >> >>> >>> > out
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> > java.net.SocketInputStream.socketRead0(Native
>> >> >> >> >>> >>> > Method)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > java.net.SocketInputStream.read(SocketInputStream.java:129)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > java.io.BufferedInputStream.read1(BufferedInputStream.java:256)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > java.io.BufferedInputStream.read(BufferedInputStream.java:317)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.commons.httpclient.ContentLengthInputStream.read(Unknown
>> >> >> >> >>> >>> > Source)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > java.io.FilterInputStream.read(FilterInputStream.java:116)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.commons.httpclient.AutoCloseInputStream.read(Unknown
>> >> >> >> >>> >>> > Source)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.crawler.connectors.webcrawler.ThrottledFetcher$ThrottledInputstream.basicRead(ThrottledFetcher.java:2012)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.crawler.connectors.webcrawler.ThrottledFetcher$ThrottledInputstream.read(ThrottledFetcher.java:1976)
>> >> >> >> >>> >>> >         at
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > org.apache.manifoldcf.crawler.connectors.webcrawler.DataCache.addData(DataCache.java:95)
>> >> >> >> >>> >>> >         ... 2 more
>> >> >> >> >>> >>> >  WARN 2012-10-10 16:16:27,274 (Worker thread '49') -
>> >> >> >> >>> >>> > Pre-ingest
>> >> >> >> >>> >>> > service
>> >> >> >> >>> >>> > interruption reported for job 1349774325961 connection
>> >> >> >> >>> >>> > 'WEB':
>> >> >> >> >>> >>> > Socket
>> >> >> >> >>> >>> > timeout: Read timed out
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > Regards,
>> >> >> >> >>> >>> >
>> >> >> >> >>> >>> > Shigeki
>> >> >> >> >>> >>
>> >> >> >> >>> >>
>> >> >> >> >>> >>
>> >> >> >> >>> >>
>> >> >> >> >>
>> >> >> >> >>
>> >> >> >> >>
>> >> >> >> >>
>> >> >> >> >
>> >> >> >> >
>> >> >> >> >
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >> >
>> >> >
>> >> >
>> >> >
>> >
>> >
>> >
>> >
>
>
>
Mime
View raw message