flink-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (FLINK-8431) Allow to specify # GPUs for TaskManager in Mesos
Date Tue, 30 Jan 2018 08:46:00 GMT

    [ https://issues.apache.org/jira/browse/FLINK-8431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16344696#comment-16344696
] 

ASF GitHub Bot commented on FLINK-8431:
---------------------------------------

Github user tillrohrmann commented on a diff in the pull request:

    https://github.com/apache/flink/pull/5307#discussion_r164672220
  
    --- Diff: flink-mesos/src/main/java/org/apache/flink/mesos/runtime/clusterframework/MesosTaskManagerParameters.java
---
    @@ -238,6 +254,12 @@ public static MesosTaskManagerParameters create(Configuration flinkConfig)
{
     			cpus = Math.max(containeredParameters.numSlots(), 1.0);
     		}
     
    +		double gpus = Math.floor(flinkConfig.getDouble(MESOS_RM_TASKS_GPUS, 0.0));
    --- End diff --
    
    From the user perspective I think it's clearer to make it an integer because then we don't
confuse users who haven't read the code and think that they can configure a fraction of a
GPU based on its type. Once Mesos accepts floats, we can change it in Flink as well. I will
apply the change while merging the PR.


> Allow to specify # GPUs for TaskManager in Mesos
> ------------------------------------------------
>
>                 Key: FLINK-8431
>                 URL: https://issues.apache.org/jira/browse/FLINK-8431
>             Project: Flink
>          Issue Type: Improvement
>          Components: Cluster Management, Mesos
>            Reporter: Dongwon Kim
>            Assignee: Dongwon Kim
>            Priority: Minor
>
> Mesos provides first-class support for Nvidia GPUs [1], but Flink does not exploit it
when scheduling TaskManagers. If Mesos agents are configured to isolate GPUs as shown in [2],
TaskManagers that do not specify to use GPUs cannot see GPUs at all.
> We, therefore, need to introduce a new configuration property named "mesos.resourcemanager.tasks.gpus"
to allow users to specify # of GPUs for each TaskManager process in Mesos.
> [1] http://mesos.apache.org/documentation/latest/gpu-support/
> [2] http://mesos.apache.org/documentation/latest/gpu-support/#agent-flags



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message