[
https://issues.apache.org/jira/browse/SQOOP-1936?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14274759#comment-14274759
]
Hudson commented on SQOOP-1936:
-------------------------------
SUCCESS: Integrated in Sqoop2-hadoop200 #703 (See
[https://builds.apache.org/job/Sqoop2-hadoop200/703/])
SQOOP-1936: Sqoop2: Sort by comparing IDF data in shuffle phase (abe:
https://git-wip-us.apache.org/repos/asf?p=sqoop.git&a=commit&h=7631d293335855fd42a319c1fcb56530c27f78a4)
*
connector/connector-sdk/src/main/java/org/apache/sqoop/connector/idf/CSVIntermediateDataFormat.java
*
execution/mapreduce/src/test/java/org/apache/sqoop/job/io/TestSqoopWritable.java
* execution/mapreduce/src/main/java/org/apache/sqoop/job/io/SqoopWritable.java
*
connector/connector-sdk/src/main/java/org/apache/sqoop/connector/idf/IntermediateDataFormat.java
*
connector/connector-sdk/src/test/java/org/apache/sqoop/connector/idf/TestCSVIntermediateDataFormat.java
*
connector/connector-sdk/src/main/java/org/apache/sqoop/connector/idf/JSONIntermediateDataFormat.java
> Sqoop2: Sort by comparing IDF data in shuffle phase
> ---------------------------------------------------
>
> Key: SQOOP-1936
> URL: https://issues.apache.org/jira/browse/SQOOP-1936
> Project: Sqoop
> Issue Type: Sub-task
> Reporter: Veena Basavaraj
> Assignee: Veena Basavaraj
> Fix For: 1.99.5
>
> Attachments: SQOOP-1936-v2.patch, SQOOP-1936-v3.patch,
> SQOOP-1936.patch
>
>
> The compareTo method in SqoopWritable looks like below, this means we are
> relying on the entire CSVString to order elements in the output.
> For instance, if we have 20 records in JDBC data source, the output will not
> be ordered by its primary key, especially when we need to do delta merges, we
> have to be cognizant that the ordering of records in the output is controlled
> by their CSV representation
> {code}
> String getString() {
> return toIDF.getCSVTextData();
> }
> @Override
> public int compareTo(SqoopWritable o) {
> return getString().compareTo(o.getString());
> }
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)