[ 
https://issues.apache.org/jira/browse/IGNITE-8795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16515621#comment-16515621
 ] 

ASF GitHub Bot commented on IGNITE-8795:
----------------------------------------

GitHub user dmitrievanthony opened a pull request:

    https://github.com/apache/ignite/pull/4214

    IGNITE-8795 Add ability to start and maintain TensorFlow cluster on top of 
Apache Ignite

    

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/gridgain/apache-ignite ignite-8795

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/ignite/pull/4214.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #4214
    
----
commit 9a8d733f19a06aea210f8da233d28fb859570819
Author: dmitrievanthony <dmitrievanthony@...>
Date:   2018-06-16T18:00:39Z

    IGNITE-8795 First prototype of TensorFlow Cluster manager.

commit 3becae8217a8556f8f985562fb732cd081c71673
Author: Anton Dmitriev <dmitrievanthony@...>
Date:   2018-06-18T11:47:43Z

    IGNITE-8795 Add tests to TensorFlow module and refactoring.

commit 5f6ee37ebdc1f9cd852d70890ea45b63c15096c1
Author: Anton Dmitriev <dmitrievanthony@...>
Date:   2018-06-18T11:50:01Z

    IGNITE-8795 Remove commented code.

----


> Add ability to start and maintain TensorFlow cluster on top of Apache Ignite
> ----------------------------------------------------------------------------
>
>                 Key: IGNITE-8795
>                 URL: https://issues.apache.org/jira/browse/IGNITE-8795
>             Project: Ignite
>          Issue Type: New Feature
>          Components: ml
>            Reporter: Yury Babak
>            Assignee: Anton Dmitriev
>            Priority: Major
>             Fix For: 2.6
>
>
> As described in the [design 
> document|https://docs.google.com/document/d/1jROIahK1rc7bSgOvhJhfpMqIGvht_IE8zn5NAt6x8ks/edit?usp=sharing],
>  Distributed TensorFlow is based on TensorFlow cluster concept. It's a set of 
> TensorFlow processes started among the cluster and available througth the 
> gRPC interfaces. It's assumed that these processes contain heavy operations 
> that requires data to be stored locally on the nodes where the processes 
> running. Apache Ignite admits the data to be moved from one node to another 
> as result of node failure of rebalancing. As result the TensorFlow cluster 
> should be changed dynamically as well as TensorFlow Cache (follow-the-data 
> strategy).



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to