From dev-return-3612-apmail-tika-dev-archive=tika.apache.org@tika.apache.org Wed Jul 07 15:36:45 2010
Return-Path:
Delivered-To: apmail-tika-dev-archive@www.apache.org
Received: (qmail 33673 invoked from network); 7 Jul 2010 15:36:44 -0000
Received: from unknown (HELO mail.apache.org) (140.211.11.3)
by 140.211.11.9 with SMTP; 7 Jul 2010 15:36:44 -0000
Received: (qmail 45471 invoked by uid 500); 7 Jul 2010 15:36:44 -0000
Delivered-To: apmail-tika-dev-archive@tika.apache.org
Received: (qmail 45355 invoked by uid 500); 7 Jul 2010 15:36:44 -0000
Mailing-List: contact dev-help@tika.apache.org; run by ezmlm
Precedence: bulk
List-Help:
List-Unsubscribe:
List-Post:
List-Id:
Reply-To: dev@tika.apache.org
Delivered-To: mailing list dev@tika.apache.org
Received: (qmail 45344 invoked by uid 99); 7 Jul 2010 15:36:43 -0000
Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230)
by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 07 Jul 2010 15:36:43 +0000
X-ASF-Spam-Status: No, hits=-2000.0 required=10.0
tests=ALL_TRUSTED
X-Spam-Check-By: apache.org
Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22)
by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 07 Jul 2010 15:36:41 +0000
Received: from thor (localhost [127.0.0.1])
by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id o67FSoTO001991
for ; Wed, 7 Jul 2010 15:28:50 GMT
Message-ID: <8617087.240251278516530003.JavaMail.jira@thor>
Date: Wed, 7 Jul 2010 11:28:50 -0400 (EDT)
From: "Julien Nioche (JIRA)"
To: dev@tika.apache.org
Subject: [jira] Created: (TIKA-457) HTMLParser gets an early
event
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: 7bit
X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394
X-Virus-Checked: Checked by ClamAV on apache.org
HTMLParser gets an early event
--------------------------------------
Key: TIKA-457
URL: https://issues.apache.org/jira/browse/TIKA-457
Project: Tika
Issue Type: Bug
Components: parser
Reporter: Julien Nioche
I am using the IdentityMapper in the HTMLparser with this simple document:
{code}