systemml-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From GitBox <...@apache.org>
Subject [GitHub] [systemml] Baunsgaard opened a new pull request #909: [SYSTEMDS-???] [WIP] Distinct values estimation function
Date Mon, 11 May 2020 09:03:23 GMT

Baunsgaard opened a new pull request #909:
URL: https://github.com/apache/systemml/pull/909


   New function `unique_length` for counting the number of distinct values.
   
   ```R
   count = unique_length(X, type="count")
   ```
   
   can be called to execute with different types of estimators:
   
   - count : The default implementation that counts by adding to an HashMap.
     Not memory efficient, but returns exact counts.
   - KMV : An estimation algorithm K Minimum Values
   - HLL : An estimation algorithm Hyper Log Log (Not finished)


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



Mime
View raw message