From solr-user-return-119475-apmail-lucene-solr-user-archive=lucene.apache.org@lucene.apache.org Tue Jan 5 00:48:05 2016 Return-Path: X-Original-To: apmail-lucene-solr-user-archive@minotaur.apache.org Delivered-To: apmail-lucene-solr-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id C503318248 for ; Tue, 5 Jan 2016 00:48:05 +0000 (UTC) Received: (qmail 52061 invoked by uid 500); 5 Jan 2016 00:48:03 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 51985 invoked by uid 500); 5 Jan 2016 00:48:03 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 51973 invoked by uid 99); 5 Jan 2016 00:48:02 -0000 Received: from Unknown (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 05 Jan 2016 00:48:02 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 8ABB21A04FE for ; Tue, 5 Jan 2016 00:48:02 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.08 X-Spam-Level: * X-Spam-Status: No, score=1.08 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, KAM_COUK=1.1, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (1024-bit key) header.d=messagingengine.com Received: from mx1-us-west.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id ZTed6CV2y2KX for ; Tue, 5 Jan 2016 00:48:02 +0000 (UTC) Received: from out4-smtp.messagingengine.com (out4-smtp.messagingengine.com [66.111.4.28]) by mx1-us-west.apache.org (ASF Mail Server at mx1-us-west.apache.org) with ESMTPS id CB35320515 for ; Tue, 5 Jan 2016 00:48:01 +0000 (UTC) Received: from compute1.internal (compute1.nyi.internal [10.202.2.41]) by mailout.nyi.internal (Postfix) with ESMTP id 238EA215AF for ; Mon, 4 Jan 2016 19:48:01 -0500 (EST) Received: from web6 ([10.202.2.216]) by compute1.internal (MEProxy); Mon, 04 Jan 2016 19:48:01 -0500 DKIM-Signature: v=1; a=rsa-sha1; c=relaxed/relaxed; d= messagingengine.com; h=content-transfer-encoding:content-type :date:from:in-reply-to:message-id:mime-version:references :subject:to:x-sasl-enc:x-sasl-enc; s=smtpout; bh=nFrW/S7Yf6jimYU AgFeZYR/AsL0=; b=PRjv7ITWIqlKFkefyQh+ds8prpUcoAqs/Rn7stTzeJT9fV5 HCyLJ/CGasUAfiJmMwtsOpJuC7NQxoUXpmbCQho74n3QIaKNI0i8BaiBvLUjJoLC kyFfuytVHQoq9akQPIlhz4yAQC524GYBXxjo2HC9dnfQ5HT9a9/hx4l7P7t4= Received: by web6.nyi.internal (Postfix, from userid 99) id E4E9446368; Mon, 4 Jan 2016 19:48:00 -0500 (EST) Message-Id: <1451954880.1622689.482926818.183770D3@webmail.messagingengine.com> X-Sasl-Enc: hbK4KSSQXZtoEHbOFG9xJ25OvyitRel7TnDAK+Fy0tec 1451954880 From: Upayavira To: solr-user@lucene.apache.org MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Type: text/plain X-Mailer: MessagingEngine.com Webmail Interface - ajax-42cc5299 In-Reply-To: References: Subject: Re: Field Size per document in Solr Date: Tue, 05 Jan 2016 00:48:00 +0000 Solr does store the term positions, but you won't find it easy to extract them, as they are stored against terms not fields. Your best bet is to index field lengths into Solr alongside the field values. You could use an UpdateProcessor to do this if you want to do it in Solr. Upayavira On Tue, Jan 5, 2016, at 12:39 AM, KNitin wrote: > Hi, > > I want to get the size of individual fields per document (or per index) > in > solrcloud. Is there a way to do this using exiting solr or lucene api? > > *Use case*: I have a few dynamic fields which may or may not be populated > everyday depending on certain conditions. I also do faceting and some > custom processing on these fields (using custom solr components). I want > to > be able to plot the per field size of an index in realtime so that I can > try to identify the trend between fields & latencies. > > Thanks a lot in advance! > Nitin