spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hyukjin Kwon (Jira)" <j...@apache.org>
Subject [jira] [Commented] (SPARK-32760) Support for INET data type
Date Wed, 02 Sep 2020 01:29:00 GMT

    [ https://issues.apache.org/jira/browse/SPARK-32760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17188902#comment-17188902
] 

Hyukjin Kwon commented on SPARK-32760:
--------------------------------------

The problem is that you should implement the serde for Python and R sides as well to make
it properly supported. This is a huge work. Let's don't do this unless there's a very strong
reason and very wide needs from the community.


> Support for INET data type
> --------------------------
>
>                 Key: SPARK-32760
>                 URL: https://issues.apache.org/jira/browse/SPARK-32760
>             Project: Spark
>          Issue Type: Sub-task
>          Components: Spark Core
>    Affects Versions: 2.4.0, 3.0.0, 3.1.0
>            Reporter: Ruslan Dautkhanov
>            Priority: Major
>
> PostgreSQL has support for `INET` data type 
> [https://www.postgresql.org/docs/9.1/datatype-net-types.html]
> We have a few customers that are interested in similar, native support for IP addresses,
just like in PostgreSQL.
> The issue with storing IP addresses as strings, is that most of the matches (like if
an IP address belong to a subnet) in most cases can't take leverage of parquet bloom filters. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message