From user-return-13506-apmail-mahout-user-archive=mahout.apache.org@mahout.apache.org Sat May 19 00:09:21 2012 Return-Path: X-Original-To: apmail-mahout-user-archive@www.apache.org Delivered-To: apmail-mahout-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id B01B99FA8 for ; Sat, 19 May 2012 00:09:21 +0000 (UTC) Received: (qmail 39345 invoked by uid 500); 19 May 2012 00:09:20 -0000 Delivered-To: apmail-mahout-user-archive@mahout.apache.org Received: (qmail 39277 invoked by uid 500); 19 May 2012 00:09:20 -0000 Mailing-List: contact user-help@mahout.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@mahout.apache.org Delivered-To: mailing list user@mahout.apache.org Received: (qmail 39266 invoked by uid 99); 19 May 2012 00:09:20 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 19 May 2012 00:09:20 +0000 X-ASF-Spam-Status: No, hits=2.8 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS,URI_HEX X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of ted.dunning@gmail.com designates 209.85.214.170 as permitted sender) Received: from [209.85.214.170] (HELO mail-ob0-f170.google.com) (209.85.214.170) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 19 May 2012 00:09:14 +0000 Received: by obbuo13 with SMTP id uo13so9994512obb.1 for ; Fri, 18 May 2012 17:08:54 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type; bh=i987keL6FXb6DUtNIL7hFJ3Ney9JmZ/2ZNF0WhS2w8Q=; b=pZ4N9OEOyZvAgHe8WGhG6Cw8L9JRXemTFyYLqm8wBUvwgt9uDEE/ODIwWLYwAyTBoS qjsHHpnw7u8jxztwlMlS+1UWEsc/hrf55VLl6F6FrRxOZRE2DCbzQliVAvaagtIjUUnJ r39qaz6jYXtGen57OjEIkAU7b49m8heasiAms+ps3EyTT/kfaX1n+FB7IoYKMyKVD0Wi WtRil0tlUPMaTLG7eTza+yjhzpTxXkfFgUeuWPQXyFzKrUtfjkFqb1Qj1BWi4rCreesA CK5JHh17u8ofFYDRUL7x7C+Wpl9JNcqfDJ7U+EneT+vu/XMTazaUgmeiYeE8Xg3TtjwJ +PIg== Received: by 10.182.12.6 with SMTP id u6mr12558870obb.12.1337386134204; Fri, 18 May 2012 17:08:54 -0700 (PDT) MIME-Version: 1.0 Received: by 10.182.40.101 with HTTP; Fri, 18 May 2012 17:08:24 -0700 (PDT) In-Reply-To: References: <1337374825242-3984795.post@n3.nabble.com> From: Ted Dunning Date: Fri, 18 May 2012 20:08:24 -0400 Message-ID: Subject: Re: How to approach this? Classification vs Recommendation To: user@mahout.apache.org Content-Type: multipart/alternative; boundary=f46d0444edd374efa404c05879e0 X-Virus-Checked: Checked by ClamAV on apache.org --f46d0444edd374efa404c05879e0 Content-Type: text/plain; charset=UTF-8 Not so trivially, these classifiers can help each other. What you have is a form of transduction or example based learnng. On Fri, May 18, 2012 at 5:24 PM, Sean Owen wrote: > Trivially it's four classifiers. You have just one input here, and > it's binary. That seems like too little info to discriminate on. All > you can learn -- and it doesn't really need a classifier algorithm -- > is there's an x% chance of encountering problem a if funded, and > (100-x)% of a if not. > > On Fri, May 18, 2012 at 10:00 PM, fht wrote: > > Hi, > > > > I suppose this a combination of a generic machine learning question and a > > mahout question. > > > > I have a data set. A user may or may not be part of a funded scheme. > > > > If there are not part of the funded scheme they might be susceptible to > > certain problems a, b, c and d. > > If there are part of the funded scheme they might incur problems a, b > and c > > but not d. > > > > I want to process the data set to infer something like people who *are* > part > > of funded scheme won't encounter problem c and d. > > > > Is this a recommendation or a classification - How do I approach this? > > > > Also can hive inteactt with mahout - I read (correct me if I'm wrong) > that > > it's probably best to input data to mahout in csv format - I assume this > is > > possible with hive? > > > > many thanks. > > > > -- > > View this message in context: > http://lucene.472066.n3.nabble.com/How-to-approach-this-Classification-vs-Recommendation-tp3984795.html > > Sent from the Mahout User List mailing list archive at Nabble.com. > --f46d0444edd374efa404c05879e0--