From user-return-13579-apmail-spark-user-archive=spark.apache.org@spark.apache.org Thu Aug 7 08:30:10 2014 Return-Path: X-Original-To: apmail-spark-user-archive@minotaur.apache.org Delivered-To: apmail-spark-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id A8C4911913 for ; Thu, 7 Aug 2014 08:30:10 +0000 (UTC) Received: (qmail 88993 invoked by uid 500); 7 Aug 2014 08:30:09 -0000 Delivered-To: apmail-spark-user-archive@spark.apache.org Received: (qmail 88931 invoked by uid 500); 7 Aug 2014 08:30:09 -0000 Mailing-List: contact user-help@spark.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list user@spark.apache.org Received: (qmail 88921 invoked by uid 99); 7 Aug 2014 08:30:09 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 07 Aug 2014 08:30:09 +0000 X-ASF-Spam-Status: No, hits=2.3 required=5.0 tests=SPF_SOFTFAIL,URI_HEX X-Spam-Check-By: apache.org Received-SPF: softfail (athena.apache.org: transitioning domain of skrishna.id@gmail.com does not designate 216.139.236.26 as permitted sender) Received: from [216.139.236.26] (HELO sam.nabble.com) (216.139.236.26) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 07 Aug 2014 08:30:04 +0000 Received: from ben.nabble.com ([192.168.236.152]) by sam.nabble.com with esmtp (Exim 4.72) (envelope-from ) id 1XFJ4y-0000cR-4A for user@spark.incubator.apache.org; Thu, 07 Aug 2014 01:29:44 -0700 Date: Thu, 7 Aug 2014 01:29:44 -0700 (PDT) From: SK To: user@spark.incubator.apache.org Message-ID: <1407400184119-11632.post@n3.nabble.com> In-Reply-To: References: <1407365109974-11592.post@n3.nabble.com> <1744048717.3937427.1407376860533.JavaMail.zimbra@stanford.edu> <1407397666065-11623.post@n3.nabble.com> Subject: Re: Naive Bayes parameters MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org Ok, thanks for clarifying. So looks like numFeatures is only relevant for lib SVM format. I am using LabeledPoint, so if data is not sparse, perhaps numFeatures is not required. I thought that the Params class defines all the parameters passed to the ML algorithm. But it looks like it also includes other options. Just as a suggestion - it may be useful to have a separate class for just the algorithm parameters, so it is clear what can be tuned. thanks -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Naive-Bayes-parameters-tp11592p11632.html Sent from the Apache Spark User List mailing list archive at Nabble.com. --------------------------------------------------------------------- To unsubscribe, e-mail: user-unsubscribe@spark.apache.org For additional commands, e-mail: user-help@spark.apache.org