From dev-return-30120-apmail-tika-dev-archive=tika.apache.org@tika.apache.org Mon Jan 7 21:12:03 2019 Return-Path: X-Original-To: apmail-tika-dev-archive@www.apache.org Delivered-To: apmail-tika-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 756ED187FA for ; Mon, 7 Jan 2019 21:12:03 +0000 (UTC) Received: (qmail 6653 invoked by uid 500); 7 Jan 2019 21:12:03 -0000 Delivered-To: apmail-tika-dev-archive@tika.apache.org Received: (qmail 6606 invoked by uid 500); 7 Jan 2019 21:12:03 -0000 Mailing-List: contact dev-help@tika.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@tika.apache.org Delivered-To: mailing list dev@tika.apache.org Received: (qmail 6594 invoked by uid 99); 7 Jan 2019 21:12:03 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 07 Jan 2019 21:12:03 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id E0021C06A0 for ; Mon, 7 Jan 2019 21:12:02 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -110.301 X-Spam-Level: X-Spam-Status: No, score=-110.301 tagged_above=-999 required=6.31 tests=[ENV_AND_HDR_SPF_MATCH=-0.5, RCVD_IN_DNSWL_MED=-2.3, SPF_PASS=-0.001, USER_IN_DEF_SPF_WL=-7.5, USER_IN_WHITELIST=-100] autolearn=disabled Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id wJ7KGwEYQRrp for ; Mon, 7 Jan 2019 21:12:01 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTP id 64F8460F0A for ; Mon, 7 Jan 2019 21:12:01 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id B0BD1E2638 for ; Mon, 7 Jan 2019 21:12:00 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id 123922554C for ; Mon, 7 Jan 2019 21:12:00 +0000 (UTC) Date: Mon, 7 Jan 2019 21:12:00 +0000 (UTC) From: "Hudson (JIRA)" To: dev@tika.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (TIKA-2809) Add reports for structure tags to tika-eval MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/TIKA-2809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16736355#comment-16736355 ] Hudson commented on TIKA-2809: ------------------------------ SUCCESS: Integrated in Jenkins build tika-branch-1x #153 (See [https://builds.apache.org/job/tika-branch-1x/153/]) TIKA-2809 -- add reports for tags; and add "b" tag. (tallison: [https://github.com/apache/tika/commit/73d009a75bb3806971865868e6bacc75717023a2]) * (edit) tika-eval/src/main/java/org/apache/tika/eval/AbstractProfiler.java * (edit) tika-eval/src/main/java/org/apache/tika/eval/ExtractProfiler.java * (edit) tika-eval/src/main/resources/profile-reports.xml * (edit) tika-eval/src/main/java/org/apache/tika/eval/db/Cols.java * (edit) CHANGES.txt * (edit) tika-eval/src/main/resources/comparison-reports.xml > Add reports for structure tags to tika-eval > ------------------------------------------- > > Key: TIKA-2809 > URL: https://issues.apache.org/jira/browse/TIKA-2809 > Project: Tika > Issue Type: Improvement > Reporter: Tim Allison > Assignee: Tim Allison > Priority: Minor > Fix For: 2.0.0, 1.21 > > > In TIKA-2791, we added the capability to extract/count structure tags, e.g.

, , from xhtml content. Let's add tag reports to the standard reports. -- This message was sent by Atlassian JIRA (v7.6.3#76005)