systemml-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Niketan Pansare" <>
Subject Open tasks: Integration with MLPipeline
Date Thu, 03 Dec 2015 22:32:21 GMT

Hi all,

In this email, I list the open tasks related to integration with
MLPipeline. This allows external developers to contribute to the SystemML
project until our JIRA server is up and running.

1. Make the existing Logistic regression wrapper more robust:
- Extend the wrapper or the DML script to handle zero-based labels (either
throw an error or support zero-based labels).

2. Improve the performance of the Logistic regression wrapper:
- Profile the wrapper to find potential bottlenecks. The candidates for
bottlenecks are RDDConverterUtilsExt.vectorDataFrameToBinaryBlock and line
153-158 in LogisticRegressionModel.

3.  Perform detailed performance analysis of the converter utils.
- Also explore the usability aspect of these utils.

4. Add MLPipeline wrappers for existing scripts.
- Refer to
to pick the algorithm and to
understand the assumptions as well as parameters to the given algorithm.
- A good algorithm to start with is L2SVM:

5. Add the documentation for MLPipeline wrappers to

1. Existing Logistic regression wrappers:

2. Converter utils:


Niketan Pansare
IBM Almaden Research Center
E-mail: npansar At

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message