[ 
https://issues.apache.org/jira/browse/SPARK-14300?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Joseph K. Bradley updated SPARK-14300:
--------------------------------------
    Description: 
Duplicated code that I found in scala/examples/mllib:

* scala/mllib
** DenseGaussianMixture.scala
** StreamingLinearRegression.scala

(This is the updated list.  The original list is copied below.)

h4. Original list of code examples to check

Original list:
* scala/mllib
** DecisionTreeRunner.scala 
** DenseGaussianMixture.scala
** DenseKMeans.scala
** GradientBoostedTreesRunner.scala
** LDAExample.scala
** LinearRegression.scala
** SparseNaiveBayes.scala
** StreamingLinearRegression.scala
** StreamingLogisticRegression.scala
** TallSkinnyPCA.scala
** TallSkinnySVD.scala

* Unsure code duplications (need doube check)
** AbstractParams.scala
** BinaryClassification.scala
** Correlations.scala
** CosineSimilarity.scala
** DenseGaussianMixture.scala
** FPGrowthExample.scala
** MovieLensALS.scala
** MultivariateSummarizer.scala
** RandomRDDGeneration.scala
** SampledRDDs.scala

When merging and cleaning those code, be sure not disturb the previous example 
on and off blocks.

  was:
Duplicated code that I found in scala/examples/mllib:

* scala/mllib
** DecisionTreeRunner.scala 
** DenseGaussianMixture.scala
** DenseKMeans.scala
** GradientBoostedTreesRunner.scala
** LDAExample.scala
** LinearRegression.scala
** SparseNaiveBayes.scala
** StreamingLinearRegression.scala
** StreamingLogisticRegression.scala
** TallSkinnyPCA.scala
** TallSkinnySVD.scala

* Unsure code duplications (need doube check)
** AbstractParams.scala
** BinaryClassification.scala
** Correlations.scala
** CosineSimilarity.scala
** DenseGaussianMixture.scala
** FPGrowthExample.scala
** MovieLensALS.scala
** MultivariateSummarizer.scala
** RandomRDDGeneration.scala
** SampledRDDs.scala

When merging and cleaning those code, be sure not disturb the previous example 
on and off blocks.


> Scala MLlib examples code merge and clean up
> --------------------------------------------
>
>                 Key: SPARK-14300
>                 URL: https://issues.apache.org/jira/browse/SPARK-14300
>             Project: Spark
>          Issue Type: Sub-task
>          Components: Examples
>            Reporter: Xusen Yin
>            Priority: Minor
>              Labels: starter
>
> Duplicated code that I found in scala/examples/mllib:
> * scala/mllib
> ** DenseGaussianMixture.scala
> ** StreamingLinearRegression.scala
> (This is the updated list.  The original list is copied below.)
> h4. Original list of code examples to check
> Original list:
> * scala/mllib
> ** DecisionTreeRunner.scala 
> ** DenseGaussianMixture.scala
> ** DenseKMeans.scala
> ** GradientBoostedTreesRunner.scala
> ** LDAExample.scala
> ** LinearRegression.scala
> ** SparseNaiveBayes.scala
> ** StreamingLinearRegression.scala
> ** StreamingLogisticRegression.scala
> ** TallSkinnyPCA.scala
> ** TallSkinnySVD.scala
> * Unsure code duplications (need doube check)
> ** AbstractParams.scala
> ** BinaryClassification.scala
> ** Correlations.scala
> ** CosineSimilarity.scala
> ** DenseGaussianMixture.scala
> ** FPGrowthExample.scala
> ** MovieLensALS.scala
> ** MultivariateSummarizer.scala
> ** RandomRDDGeneration.scala
> ** SampledRDDs.scala
> When merging and cleaning those code, be sure not disturb the previous 
> example on and off blocks.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to