[
https://issues.apache.org/jira/browse/IGNITE-8795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16515621#comment-16515621
]
ASF GitHub Bot commented on IGNITE-8795:
----------------------------------------
GitHub user dmitrievanthony opened a pull request:
https://github.com/apache/ignite/pull/4214
IGNITE-8795 Add ability to start and maintain TensorFlow cluster on top of
Apache Ignite
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/gridgain/apache-ignite ignite-8795
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/ignite/pull/4214.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #4214
----
commit 9a8d733f19a06aea210f8da233d28fb859570819
Author: dmitrievanthony <dmitrievanthony@...>
Date: 2018-06-16T18:00:39Z
IGNITE-8795 First prototype of TensorFlow Cluster manager.
commit 3becae8217a8556f8f985562fb732cd081c71673
Author: Anton Dmitriev <dmitrievanthony@...>
Date: 2018-06-18T11:47:43Z
IGNITE-8795 Add tests to TensorFlow module and refactoring.
commit 5f6ee37ebdc1f9cd852d70890ea45b63c15096c1
Author: Anton Dmitriev <dmitrievanthony@...>
Date: 2018-06-18T11:50:01Z
IGNITE-8795 Remove commented code.
----
> Add ability to start and maintain TensorFlow cluster on top of Apache Ignite
> ----------------------------------------------------------------------------
>
> Key: IGNITE-8795
> URL: https://issues.apache.org/jira/browse/IGNITE-8795
> Project: Ignite
> Issue Type: New Feature
> Components: ml
> Reporter: Yury Babak
> Assignee: Anton Dmitriev
> Priority: Major
> Fix For: 2.6
>
>
> As described in the [design
> document|https://docs.google.com/document/d/1jROIahK1rc7bSgOvhJhfpMqIGvht_IE8zn5NAt6x8ks/edit?usp=sharing],
> Distributed TensorFlow is based on TensorFlow cluster concept. It's a set of
> TensorFlow processes started among the cluster and available througth the
> gRPC interfaces. It's assumed that these processes contain heavy operations
> that requires data to be stored locally on the nodes where the processes
> running. Apache Ignite admits the data to be moved from one node to another
> as result of node failure of rebalancing. As result the TensorFlow cluster
> should be changed dynamically as well as TensorFlow Cache (follow-the-data
> strategy).
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)