spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Andre Schumacher (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (SPARK-1487) Support record filtering via predicate pushdown in Parquet
Date Sat, 24 May 2014 19:39:01 GMT

     [ https://issues.apache.org/jira/browse/SPARK-1487?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Andre Schumacher updated SPARK-1487:
------------------------------------

    Affects Version/s:     (was: 1.0.0)
                       1.1.0

> Support record filtering via predicate pushdown in Parquet
> ----------------------------------------------------------
>
>                 Key: SPARK-1487
>                 URL: https://issues.apache.org/jira/browse/SPARK-1487
>             Project: Spark
>          Issue Type: Improvement
>          Components: SQL
>    Affects Versions: 1.1.0
>            Reporter: Andre Schumacher
>            Assignee: Andre Schumacher
>             Fix For: 1.1.0
>
>
> Parquet has support for column filters, which can be used to avoid reading and de-serializing
records that fail the column filter condition. This can lead to potentially large savings,
depending on the number of columns filtered by and how many records actually pass the filter.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message