date:20230907

[jira] [Created] (HADOOP-18882) HDFS defaults tls cipher to "no encryption" when keystore key is unset or empty

2023-09-07 Thread Jira

Sönke Liebau created HADOOP-18882:
-

 Summary: HDFS defaults tls cipher to "no encryption" when keystore 
key is unset or empty
 Key: HADOOP-18882
 URL: https://issues.apache.org/jira/browse/HADOOP-18882
 Project: Hadoop Common
  Issue Type: Bug
  Components: security
Affects Versions: 3.3.4
 Environment: We saw this issue when running in a Kubernetes 
environment.

Hadoop was deployed using the [Stackable Operator for Apache 
Hadoop|[https://github.com/stackabletech/hdfs-operator|http://example.com/]]. 

The binaries contained in the deployed images are taken from the ASF mirrors, 
not self-compiled.
Reporter: Sönke Liebau


It looks like some hdfs servers default the cipher suite to not encrypt traffic 
when the keystore password is not set or set to an empty string.

Historically this has probably not often been an issue as java `keytool` 
refuses to create a keystore with less than 6 characters, so usually people 
would need to set passwords on the keystores (and hence in the config).

When using keystores without a password, we noticed that HDFS refuses to load 
keys from this keystore when `ssl.server.keystore.password` is unset or set to 
an empty string - and instead of erroring out sets the cipher suite for rpc 
connections to `TLS_NULL_WITH_NULL_NULL` which is basically TLS but without any 
encryption.

The impact varies depending on which communication channel we looked at, what 
we saw was:
 * JournalNodes seem to happily go along with this and NameNodes equally 
happily connect to the JournalNodes without any warnings - we do have tls 
enabled after all :)
 * NameNodes refuse connections with a handshake exception, so the real world 
impact of this should hopefully be small, but it does seem like less than ideal 
behavior.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: common-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-dev-h...@hadoop.apache.org

[jira] [Created] (HADOOP-18883) Expect-100 JDK bug resolution: prevent multiple server calls

2023-09-07 Thread Pranav Saxena (Jira)

Pranav Saxena created HADOOP-18883:
--

 Summary: Expect-100 JDK bug resolution: prevent multiple server 
calls
 Key: HADOOP-18883
 URL: https://issues.apache.org/jira/browse/HADOOP-18883
 Project: Hadoop Common
  Issue Type: Sub-task
  Components: fs/azure
Reporter: Pranav Saxena
Assignee: Pranav Saxena
 Fix For: 3.4.0


This is inline to JDK bug: [https://bugs.openjdk.org/browse/JDK-8314978].

 
With the current implementation of HttpURLConnection if server rejects the 
“Expect 100-continue” then there will be ‘java.net.ProtocolException’ will be 
thrown from 'expect100Continue()' method.

After the exception thrown, If we call any other method on the same instance 
(ex getHeaderField(), or getHeaderFields()). They will internally call 
getOuputStream() which invokes writeRequests(), which make the actual server 
call. 




In the AbfsHttpOperation, after sendRequest() we call processResponse() method 
from AbfsRestOperation. Even if the conn.getOutputStream() fails due to 
expect-100 error, we consume the exception and let the code go ahead. So, we 
can have getHeaderField() / getHeaderFields() / getHeaderFieldLong() which will 
be triggered after getOutputStream is failed. These invocation will lead to 
server calls.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: common-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-dev-h...@hadoop.apache.org

[jira] [Resolved] (HADOOP-18447) Vectored IO: Threadpool should be closed on interrupts or during close calls

2023-09-07 Thread Steve Loughran (Jira)



 [ 
https://issues.apache.org/jira/browse/HADOOP-18447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Steve Loughran resolved HADOOP-18447.
-
Resolution: Duplicate

HADOOP-18347 uses bounded pool, so is shutdown in fs.close()

> Vectored IO: Threadpool should be closed on interrupts or during close calls
> 
>
> Key: HADOOP-18447
> URL: https://issues.apache.org/jira/browse/HADOOP-18447
> Project: Hadoop Common
>  Issue Type: Sub-task
>  Components: common, fs, fs/adl, fs/s3
>Affects Versions: 3.3.5
>Reporter: Rajesh Balamohan
>Priority: Major
>  Labels: performance, stability
> Attachments: Screenshot 2022-09-08 at 9.22.07 AM.png
>
>
> Vectored IO threadpool should be closed on any interrupts or during 
> S3AFileSystem/S3AInputStream close() calls.
> E.g Query which got cancelled in the middle of the run. However, in 
> background (e.g LLAP) vectored IO threads continued to run.
>  
> !Screenshot 2022-09-08 at 9.22.07 AM.png|width=537,height=164!



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: common-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-dev-h...@hadoop.apache.org

[jira] [Created] (HADOOP-18884) [ABFS] Support VectorIO in ABFS Input Stream

2023-09-07 Thread Steve Loughran (Jira)

Steve Loughran created HADOOP-18884:
---

 Summary: [ABFS] Support VectorIO in ABFS Input Stream
 Key: HADOOP-18884
 URL: https://issues.apache.org/jira/browse/HADOOP-18884
 Project: Hadoop Common
  Issue Type: Sub-task
  Components: fs/azure
Affects Versions: 3.3.9
Reporter: Steve Loughran


the hadoop vector IO APIs are supported in file;// and s3a://; there's a hive 
ORC patch for this and PARQUET-2171 adds it for parquet -after which all apps 
using the library with a matching hadoop version and the feature enabled will 
get a significant speedup.

abfs needs to support too, which needs support for parallel GET requests for 
different ranges



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: common-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-dev-h...@hadoop.apache.org

Apache Hadoop qbt Report: trunk+JDK8 on Linux/x86_64

2023-09-07 Thread Apache Jenkins Server

For more details, see 
https://ci-hadoop.apache.org/job/hadoop-qbt-trunk-java8-linux-x86_64/1342/

[Sep 6, 2023, 2:27:52 AM] (github) HDFS-16933. A race in SerialNumberMap will 
cause wrong owner, group and XATTR. (#5430). Contributed by ZanderXu.
[Sep 6, 2023, 12:19:04 PM] (github) HDFS-17140. Revisit the 
BPOfferService.reportBadBlocks() method. (#5924). Contributed by Liangjun He.
[Sep 6, 2023, 2:08:59 PM] (github) YARN-11537. [Federation] Router CLI Supports 
List SubClusterPolicyConfiguration Of Queues. (#5944) Contributed by Shilun Fan.
[Sep 6, 2023, 4:17:12 PM] (github) HDFS-17167. Add config to startup NameNode 
as Observer (#6013)
[Sep 7, 2023, 1:32:36 AM] (Szilard Nemeth) HADOOP-18870. CURATOR-599 change 
broke functionality introduced in HADOOP-18139 and HADOOP-18709. Contributed by 
Ferenc Erdelyi




-1 overall


The following subsystems voted -1:
blanks hadolint pathlen unit xml


The following subsystems voted -1 but
were configured to be filtered/ignored:
cc checkstyle javac javadoc pylint shellcheck


The following subsystems are considered long running:
(runtime bigger than 1h  0m  0s)
unit


Specific tests:

XML :

   Parsing Error(s): 
   
hadoop-common-project/hadoop-common/src/test/resources/xml/external-dtd.xml 
   
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/test/resources/nvidia-smi-output-excerpt.xml
 
   
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/test/resources/nvidia-smi-output-missing-tags.xml
 
   
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/test/resources/nvidia-smi-output-missing-tags2.xml
 
   
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/test/resources/nvidia-smi-sample-output.xml
 
   
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/resources/fair-scheduler-invalid.xml
 
   
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/resources/yarn-site-with-invalid-allocation-file-ref.xml
 

Failed junit tests :

   hadoop.mapreduce.v2.TestUberAM 
   hadoop.mapreduce.v2.TestMRJobsWithProfiler 
   hadoop.mapreduce.v2.TestMRJobs 
  

   cc:

  
https://ci-hadoop.apache.org/job/hadoop-qbt-trunk-java8-linux-x86_64/1342/artifact/out/results-compile-cc-root.txt
 [96K]

   javac:

  
https://ci-hadoop.apache.org/job/hadoop-qbt-trunk-java8-linux-x86_64/1342/artifact/out/results-compile-javac-root.txt
 [12K]

   blanks:

  
https://ci-hadoop.apache.org/job/hadoop-qbt-trunk-java8-linux-x86_64/1342/artifact/out/blanks-eol.txt
 [15M]
  
https://ci-hadoop.apache.org/job/hadoop-qbt-trunk-java8-linux-x86_64/1342/artifact/out/blanks-tabs.txt
 [2.0M]

   checkstyle:

  
https://ci-hadoop.apache.org/job/hadoop-qbt-trunk-java8-linux-x86_64/1342/artifact/out/results-checkstyle-root.txt
 [13M]

   hadolint:

  
https://ci-hadoop.apache.org/job/hadoop-qbt-trunk-java8-linux-x86_64/1342/artifact/out/results-hadolint.txt
 [20K]

   pathlen:

  
https://ci-hadoop.apache.org/job/hadoop-qbt-trunk-java8-linux-x86_64/1342/artifact/out/results-pathlen.txt
 [16K]

   pylint:

  
https://ci-hadoop.apache.org/job/hadoop-qbt-trunk-java8-linux-x86_64/1342/artifact/out/results-pylint.txt
 [20K]

   shellcheck:

  
https://ci-hadoop.apache.org/job/hadoop-qbt-trunk-java8-linux-x86_64/1342/artifact/out/results-shellcheck.txt
 [24K]

   xml:

  
https://ci-hadoop.apache.org/job/hadoop-qbt-trunk-java8-linux-x86_64/1342/artifact/out/xml.txt
 [24K]

   javadoc:

  
https://ci-hadoop.apache.org/job/hadoop-qbt-trunk-java8-linux-x86_64/1342/artifact/out/results-javadoc-javadoc-root.txt
 [244K]

   unit:

  
https://ci-hadoop.apache.org/job/hadoop-qbt-trunk-java8-linux-x86_64/1342/artifact/out/patch-unit-hadoop-mapreduce-project_hadoop-mapreduce-client_hadoop-mapreduce-client-jobclient.txt
 [72K]

Powered by Apache Yetus 0.14.0-SNAPSHOT   https://yetus.apache.org

-
To unsubscribe, e-mail: common-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-dev-h...@hadoop.apache.org

Apache Hadoop qbt Report: branch-3.3+JDK8 on Linux/x86_64

2023-09-07 Thread Apache Jenkins Server

For more details, see 
https://ci-hadoop.apache.org/job/hadoop-qbt-branch-3.3-java8-linux-x86_64/125/

[Aug 31, 2023, 2:00:05 PM] (Steve Loughran) HADOOP-18328. S3A to support S3 on 
Outposts (#4533)




-1 overall


The following subsystems voted -1:
blanks pathlen unit xml


The following subsystems voted -1 but
were configured to be filtered/ignored:
cc checkstyle javac javadoc pylint shellcheck


The following subsystems are considered long running:
(runtime bigger than 1h  0m  0s)
unit


Specific tests:

XML :

   Parsing Error(s): 
   
hadoop-common-project/hadoop-common/src/test/resources/xml/external-dtd.xml 
   
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/test/resources/nvidia-smi-output-excerpt.xml
 
   
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/test/resources/nvidia-smi-output-missing-tags.xml
 
   
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/test/resources/nvidia-smi-output-missing-tags2.xml
 
   
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/test/resources/nvidia-smi-sample-output.xml
 
   
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/resources/fair-scheduler-invalid.xml
 
   
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/resources/yarn-site-with-invalid-allocation-file-ref.xml
 

Failed junit tests :

   hadoop.hdfs.server.datanode.TestDirectoryScanner 
   
hadoop.yarn.server.nodemanager.containermanager.logaggregation.TestLogAggregationService
 
   hadoop.yarn.sls.appmaster.TestAMSimulator 
  

   cc:

  
https://ci-hadoop.apache.org/job/hadoop-qbt-branch-3.3-java8-linux-x86_64/125/artifact/out/results-compile-cc-root.txt
 [48K]

   javac:

  
https://ci-hadoop.apache.org/job/hadoop-qbt-branch-3.3-java8-linux-x86_64/125/artifact/out/results-compile-javac-root.txt
 [380K]

   blanks:

  
https://ci-hadoop.apache.org/job/hadoop-qbt-branch-3.3-java8-linux-x86_64/125/artifact/out/blanks-eol.txt
 [15M]
  
https://ci-hadoop.apache.org/job/hadoop-qbt-branch-3.3-java8-linux-x86_64/125/artifact/out/blanks-tabs.txt
 [2.0M]

   checkstyle:

  
https://ci-hadoop.apache.org/job/hadoop-qbt-branch-3.3-java8-linux-x86_64/125/artifact/out/results-checkstyle-root.txt
 [14M]

   pathlen:

  
https://ci-hadoop.apache.org/job/hadoop-qbt-branch-3.3-java8-linux-x86_64/125/artifact/out/results-pathlen.txt
 [16K]

   pylint:

  
https://ci-hadoop.apache.org/job/hadoop-qbt-branch-3.3-java8-linux-x86_64/125/artifact/out/results-pylint.txt
 [20K]

   shellcheck:

  
https://ci-hadoop.apache.org/job/hadoop-qbt-branch-3.3-java8-linux-x86_64/125/artifact/out/results-shellcheck.txt
 [20K]

   xml:

  
https://ci-hadoop.apache.org/job/hadoop-qbt-branch-3.3-java8-linux-x86_64/125/artifact/out/xml.txt
 [28K]

   javadoc:

  
https://ci-hadoop.apache.org/job/hadoop-qbt-branch-3.3-java8-linux-x86_64/125/artifact/out/results-javadoc-javadoc-root.txt
 [972K]

   unit:

  
https://ci-hadoop.apache.org/job/hadoop-qbt-branch-3.3-java8-linux-x86_64/125/artifact/out/patch-unit-hadoop-hdfs-project_hadoop-hdfs.txt
 [536K]
  
https://ci-hadoop.apache.org/job/hadoop-qbt-branch-3.3-java8-linux-x86_64/125/artifact/out/patch-unit-hadoop-yarn-project_hadoop-yarn_hadoop-yarn-server_hadoop-yarn-server-nodemanager.txt
 [96K]
  
https://ci-hadoop.apache.org/job/hadoop-qbt-branch-3.3-java8-linux-x86_64/125/artifact/out/patch-unit-hadoop-tools_hadoop-sls.txt
 [16K]

Powered by Apache Yetus 0.14.0-SNAPSHOT   https://yetus.apache.org

-
To unsubscribe, e-mail: common-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-dev-h...@hadoop.apache.org

Apache Hadoop qbt Report: branch-2.10+JDK7 on Linux/x86_64

2023-09-07 Thread Apache Jenkins Server

For more details, see 
https://ci-hadoop.apache.org/job/hadoop-qbt-branch-2.10-java7-linux-x86_64/1143/

No changes


ERROR: File 'out/email-report.txt' does not exist

-
To unsubscribe, e-mail: common-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-dev-h...@hadoop.apache.org

Apache Hadoop qbt Report: trunk+JDK11 on Linux/x86_64

2023-09-07 Thread Apache Jenkins Server

For more details, see 
https://ci-hadoop.apache.org/job/hadoop-qbt-trunk-java11-linux-x86_64/548/

[Sep 5, 2023, 9:34:05 AM] (github) HADOOP-18880. Add some rpc related metrics 
to Metrics.md (#6015) Contributed by Yanghai Hu.
[Sep 6, 2023, 12:48:27 AM] (github) HDFS-17166. RBF: Throwing 
NoNamenodesAvailableException for a long time, when failover (#5990)
[Sep 6, 2023, 2:27:52 AM] (github) HDFS-16933. A race in SerialNumberMap will 
cause wrong owner, group and XATTR. (#5430). Contributed by ZanderXu.
[Sep 6, 2023, 12:19:04 PM] (github) HDFS-17140. Revisit the 
BPOfferService.reportBadBlocks() method. (#5924). Contributed by Liangjun He.
[Sep 6, 2023, 2:08:59 PM] (github) YARN-11537. [Federation] Router CLI Supports 
List SubClusterPolicyConfiguration Of Queues. (#5944) Contributed by Shilun Fan.
[Sep 6, 2023, 4:17:12 PM] (github) HDFS-17167. Add config to startup NameNode 
as Observer (#6013)
[Sep 7, 2023, 1:32:36 AM] (Szilard Nemeth) HADOOP-18870. CURATOR-599 change 
broke functionality introduced in HADOOP-18139 and HADOOP-18709. Contributed by 
Ferenc Erdelyi




-1 overall


The following subsystems voted -1:
blanks hadolint mvnsite pathlen spotbugs unit xml


The following subsystems voted -1 but
were configured to be filtered/ignored:
cc checkstyle javac javadoc pylint shellcheck


The following subsystems are considered long running:
(runtime bigger than 1h  0m  0s)
unit


Specific tests:

XML :

   Parsing Error(s): 
   
hadoop-common-project/hadoop-common/src/test/resources/xml/external-dtd.xml 
   
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/test/resources/nvidia-smi-output-excerpt.xml
 
   
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/test/resources/nvidia-smi-output-missing-tags.xml
 
   
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/test/resources/nvidia-smi-output-missing-tags2.xml
 
   
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/test/resources/nvidia-smi-sample-output.xml
 
   
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/resources/fair-scheduler-invalid.xml
 
   
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/resources/yarn-site-with-invalid-allocation-file-ref.xml
 

spotbugs :

   module:hadoop-hdfs-project/hadoop-hdfs 
   Redundant nullcheck of oldLock, which is known to be non-null in 
org.apache.hadoop.hdfs.server.datanode.DataStorage.isPreUpgradableLayout(Storage$StorageDirectory))
 Redundant null check at DataStorage.java:is known to be non-null in 
org.apache.hadoop.hdfs.server.datanode.DataStorage.isPreUpgradableLayout(Storage$StorageDirectory))
 Redundant null check at DataStorage.java:[line 695] 
   Redundant nullcheck of metaChannel, which is known to be non-null in 
org.apache.hadoop.hdfs.server.datanode.fsdataset.impl.MappableBlockLoader.verifyChecksum(long,
 FileInputStream, FileChannel, String) Redundant null check at 
MappableBlockLoader.java:is known to be non-null in 
org.apache.hadoop.hdfs.server.datanode.fsdataset.impl.MappableBlockLoader.verifyChecksum(long,
 FileInputStream, FileChannel, String) Redundant null check at 
MappableBlockLoader.java:[line 138] 
   Redundant nullcheck of blockChannel, which is known to be non-null in 
org.apache.hadoop.hdfs.server.datanode.fsdataset.impl.MemoryMappableBlockLoader.load(long,
 FileInputStream, FileInputStream, String, ExtendedBlockId) Redundant null 
check at MemoryMappableBlockLoader.java:is known to be non-null in 
org.apache.hadoop.hdfs.server.datanode.fsdataset.impl.MemoryMappableBlockLoader.load(long,
 FileInputStream, FileInputStream, String, ExtendedBlockId) Redundant null 
check at MemoryMappableBlockLoader.java:[line 75] 
   Redundant nullcheck of blockChannel, which is known to be non-null in 
org.apache.hadoop.hdfs.server.datanode.fsdataset.impl.NativePmemMappableBlockLoader.load(long,
 FileInputStream, FileInputStream, String, ExtendedBlockId) Redundant null 
check at NativePmemMappableBlockLoader.java:is known to be non-null in 
org.apache.hadoop.hdfs.server.datanode.fsdataset.impl.NativePmemMappableBlockLoader.load(long,
 FileInputStream, FileInputStream, String, ExtendedBlockId) Redundant null 
check at NativePmemMappableBlockLoader.java:[line 85] 
   Redundant nullcheck of metaChannel, which is known to be non-null in 
org.apache.hadoop.hdfs.server.datanode.fsdataset.impl.NativePmemMappableBlockLoader.verifyChecksumAndMapBlock(NativeIO$POSIX$$PmemMappedRegion,,
 long, FileInputStream, FileChannel, String) Redundant null check at 
NativePmemMappableBlockLoader.java:is known to be non-null in 
org.apache.hadoop.hdfs.server.datanode.fsdataset.impl.NativePmemMappableBlockLoader.verifyChecksumAndMapBlock(NativeIO$POSIX$$PmemMappedRegion,,
 long, FileInputStream, FileChannel, Stri

[jira] [Created] (HADOOP-18882) HDFS defaults tls cipher to "no encryption" when keystore key is unset or empty

[jira] [Created] (HADOOP-18883) Expect-100 JDK bug resolution: prevent multiple server calls

[jira] [Resolved] (HADOOP-18447) Vectored IO: Threadpool should be closed on interrupts or during close calls

[jira] [Created] (HADOOP-18884) [ABFS] Support VectorIO in ABFS Input Stream

Apache Hadoop qbt Report: trunk+JDK8 on Linux/x86_64

Apache Hadoop qbt Report: branch-3.3+JDK8 on Linux/x86_64

Apache Hadoop qbt Report: branch-2.10+JDK7 on Linux/x86_64

Apache Hadoop qbt Report: trunk+JDK11 on Linux/x86_64

8 matches

Site Navigation

Mail list logo

Footer information