Hi, I'm having problems deploying the kylin4_on_cloud project located at:
https://github.com/apache/kylin/tree/kylin4_on_cloud

I've also been following the instructions here
https://www.youtube.com/watch?v=5kKXEMjO1Sc&ab_channel=Kyligence

I used windows to git clone the repo and set up the venv with the latest
packages via:
pip install PyYAML
pip install boto3
pip install botocore
pip install pyparsing
pip install requests
pip install retrying
pip install Jinja2
pip install pytest-shutil
:
I also changed the RDSEngineVersion to 8.0.35 in kylin_configs.yaml, as
RDSEngineVersion 5.7.25 (default repo version) was giving me the error
"Exception: Current stack: ec2-rds-stack is create failed, please check".

Here's the log with error I am now getting:

==========================================================================
(venv) C:\projects\kylin4_on_cloud>python deploy.py --type deploy --mode job
2024-02-07 02:13:54 - botocore.credentials - INFO - 5484 - Found
credentials in shared credentials file: ~/.aws/credentials
2024-02-07 02:13:57 - engine - INFO - 5484 - Env already inited, skip init
again.
2024-02-07 02:13:58 - clouds.aws - WARNING - 5484 - Current env for
deploying a cluster is not ready.
2024-02-07 02:14:20 - instances.aws_instance - INFO - 5484 - Now creating
stack: ec2-or-emr-vpc-stack.
2024-02-07 02:16:42 - instances.aws_instance - INFO - 5484 - Now creating
stack: ec2-rds-stack.
2024-02-07 02:21:06 - instances.aws_instance - INFO - 5484 - Now creating
stack: ec2-static-service-stack.
2024-02-07 02:21:06 - engine - INFO - 5484 - First launch default Kylin
Cluster.
2024-02-07 02:22:08 - clouds.aws - WARNING - 5484 - Current cluster is not
ready.
2024-02-07 02:22:30 - instances.aws_instance - INFO - 5484 - Now creating
stack: ec2-zookeeper-stack.
2024-02-07 02:23:43 - instances.aws_instance - INFO - 5484 - Current
execute commands in `Zookeeper stack` which named ec2-zookeeper-stack.
2024-02-07 02:23:43 - instances.aws_instance - INFO - 5484 - Current
instance id: i-0cbc37f83c9cda006 is executing commands: grep -Fq
"10.1.0.133:2888:3888" /home/ec2-user/hadoop/zookeeper/conf/zoo.cfg; echo
$?.
2024-02-07 02:23:49 - instances.aws_instance - INFO - 5484 - Current
instance id: i-0915d44c700e644dc is executing commands: grep -Fq
"10.1.0.129:2888:3888" /home/ec2-user/hadoop/zookeeper/conf/zoo.cfg; echo
$?.
2024-02-07 02:23:54 - instances.aws_instance - INFO - 5484 - Current
instance id: i-0fdbacc22ecae360a is executing commands: grep -Fq
"10.1.0.58:2888:3888" /home/ec2-user/hadoop/zookeeper/conf/zoo.cfg; echo $?.
2024-02-07 02:24:00 - instances.aws_instance - INFO - 5484 - Current
instance id: i-0cbc37f83c9cda006 is executing commands: echo
'server.1=10.1.0.133:2888:3888
server.2=10.1.0.129:2888:3888
server.3=10.1.0.58:2888:3888' >>
/home/ec2-user/hadoop/zookeeper/conf/zoo.cfg.
2024-02-07 02:24:05 - instances.aws_instance - WARNING - 5484 -
{'CommandId': '704b776f-e574-47ea-bf13-30d3be2e9df2', 'InstanceId':
'i-0cbc37f83c9cda006', 'Comment': '', 'DocumentName': 'AWS-RunShellScript',
'DocumentVersion': '$DEFAULT', 'PluginName': 'aws:runShellScript',
'ResponseCode': 1,
'ExecutionStartDateTime': '2024-02-06T16:24:00.394Z',
'ExecutionElapsedTime': 'PT0.008S', 'ExecutionEndDateTime':
'2024-02-06T16:24:00.394Z', 'Status': 'Failed', 'StatusDetails': 'Failed',
'StandardOutputContent': '', 'StandardOutputUrl': '',
'StandardErrorContent':
'/var/lib/amazon/ssm/i-0cbc37f83c9cda006/document/orchestration/704b776f-e574-47ea-bf13-30d3be2e9df2/awsrunShellScript/0.awsrunShellScript/_script.sh:
line 3: /home/ec2-user/hadoop/zookeeper/conf/zoo.cfg: No such file or
directory\nfailed to run commands: exit status 1', 'StandardErrorUrl': '',
'CloudWatchOutputConfig': {'CloudWatchLogGroupName': '',
'CloudWatchOutputEnabled': False}, 'ResponseMetadata': {'RequestId':
'133ea7d8-d661-4ea0-960d-349b294dd8a9', 'HTTPStatusCode': 200,
'HTTPHeaders': {'server': 'Server', 'date': 'Tue, 06 Feb 2024 16:24:05
GMT', 'content-type': 'application/x-amz-json-1.1', 'content-length':
'848', 'connection': 'keep-alive', 'x-amzn-requestid':
'133ea7d8-d661-4ea0-960d-349b294dd8a9'}, 'RetryAttempts': 0}}
Traceback (most recent call last):
  File "C:\myfiles\_clients\me\kylin\kylin4_on_cloud\deploy.py", line 141,
in <module>
    deploy_on_aws(args.type, args.kylin_mode, args.scale_type,
args.node_type, args.cluster)
  File "C:\myfiles\_clients\me\kylin\kylin4_on_cloud\deploy.py", line 63,
in deploy_on_aws
    aws_engine.launch_default_cluster()
  File "C:\myfiles\_clients\me\kylin\kylin4_on_cloud\engine.py", line 38,
in launch_default_cluster
    self.engine_utils.launch_default_cluster()
  File
"C:\myfiles\_clients\me\kylin\kylin4_on_cloud\utils\engine_utils.py", line
101, in launch_default_cluster
    cloud_addr = self.get_kylin_address()
  File
"C:\myfiles\_clients\me\kylin\kylin4_on_cloud\utils\engine_utils.py", line
217, in get_kylin_address
    kylin_address = self.aws.get_kylin_address()
  File "C:\myfiles\_clients\me\kylin\kylin4_on_cloud\clouds\aws.py", line
149, in get_kylin_address
    kylin_resources = self.get_kylin_resources()
  File "C:\myfiles\_clients\me\kylin\kylin4_on_cloud\clouds\aws.py", line
157, in get_kylin_resources
    self.init_cluster()
  File "C:\myfiles\_clients\me\kylin\kylin4_on_cloud\clouds\aws.py", line
137, in init_cluster
    self.cloud_instance.after_create_zk_cluster()
  File
"C:\myfiles\_clients\me\kylin\kylin4_on_cloud\instances\aws_instance.py",
line 551, in after_create_zk_cluster
    self.after_create_zk_of_target_cluster()
  File
"C:\myfiles\_clients\me\kylin\kylin4_on_cloud\instances\aws_instance.py",
line 569, in after_create_zk_of_target_cluster
    self.refresh_zks_cfg(zk_ips=zk_ips, zk_ids=zk_ids)
  File
"C:\myfiles\_clients\me\kylin\kylin4_on_cloud\instances\aws_instance.py",
line 582, in refresh_zks_cfg
    self.exec_script_instance_and_return(name_or_id=zk_id,
script=refresh_command)
  File
"C:\myfiles\_clients\me\kylin\kylin4_on_cloud\instances\aws_instance.py",
line 1978, in exec_script_instance_and_return
    assert output and output['Status'] == 'Success', \
AssertionError: execute script failed, failed details message:
{'CommandId': '704b776f-e574-47ea-bf13-30d3be2e9df2', 'InstanceId':
'i-0cbc37f83c9cda006', 'Comment': '', 'DocumentName': 'AWS-RunShellScript',
'DocumentVersion': '$DEFAULT', 'PluginName': 'aws:runShellScript',
'ResponseCode': 1, 'ExecutionStartDateTime': '2024-02-06T16:24:00.394Z',
'ExecutionElapsedTime': 'PT0.008S', 'ExecutionEndDateTime':
'2024-02-06T16:24:00.394Z', 'Status': 'Failed', 'StatusDetails': 'Failed',
'StandardOutputContent': '', 'StandardOutputUrl': '',
'StandardErrorContent':
'/var/lib/amazon/ssm/i-0cbc37f83c9cda006/document/orchestration/704b776f-e574-47ea-bf13-30d3be2e9df2/awsrunShellScript/0.awsrunShellScript/_script.sh:
line 3: /home/ec2-user/hadoop/zookeeper/conf/zoo.cfg: No such file or
directory\nfailed to run commands: exit status 1', 'StandardErrorUrl': '',
'CloudWatchOutputConfig':
{'CloudWatchLogGroupName': '', 'CloudWatchOutputEnabled': False},
'ResponseMetadata': {'RequestId': '133ea7d8-d661-4ea0-960d-349b294dd8a9',
'HTTPStatusCode': 200, 'HTTPHeaders': {'server': 'Server', 'date': 'Tue, 06
Feb 2024 16:24:05 GMT', 'content-type': 'application/x-amz-json-1.1',
'content-length': '848', 'connection': 'keep-alive', 'x-amzn-requestid':
'133ea7d8-d661-4ea0-960d-349b294dd8a9'}, 'RetryAttempts': 0}}
==========================================================================

I checked on zk ec2 instances and there is no /home/ec2-user/hadoop
directory.

I also noticed that the file "prepare-ec2-env-for-zk.sh" has been uploaded
to the home directory on all zk ec2 nodes:
---------------------------------------------------------------------
drwx------ 3 ec2-user ec2-user   107 Feb  5 17:58 .
drwxr-xr-x 4 root     root        38 Feb  5 17:57 ..
-rw-r--r-- 1 ec2-user ec2-user    18 Jul 27  2018 .bash_logout
-rw-r--r-- 1 ec2-user ec2-user   193 Jul 27  2018 .bash_profile
-rw-r--r-- 1 ec2-user ec2-user   231 Jul 27  2018 .bashrc
drwx------ 2 ec2-user ec2-user    29 Feb  5 17:57 .ssh
-rw-r--r-- 1 root     root     10119 Feb  4 23:40 prepare-ec2-env-for-zk.sh
---------------------------------------------------------------------

I guess the deploy script hasn't been able to execute the
"prepare-ec2-env-for-zk.sh" script on the three zk ec2 nodes? I'm not sure
what else could be wrong.

Can someone please tell me how to get the deployment working?

Thanks.

Reply via email to