最好是把ddl跟insert into的语句都贴出来看下。 -----邮件原件----- 发件人: 陈奇昌 <1879467...@qq.com.INVALID> 发送时间: 2022年11月10日 17:29 收件人: dev <dev@doris.apache.org> 主题: be节点宕机后起不来
hi: 使用版本为apache-doris-be-1.1.2,因为写了demo,循环6万次,单个insert into语句,执行后,3个节点都挂掉,并且启动后立马停止。disable_auto_compaction=true可以启动,但不能从根本解决问题。be.out日志如下: start time: Thu Nov 10 17:11:43 CST 2022 WARNING: Logging before InitGoogleLogging() is written to STDERR I1110 17:11:43.721869 4706 env.cpp:46] Env init successfully. *** Aborted at 1668071504 (unix time) try "date -d @1668071504" if you are using GNU date *** *** SIGSEGV unkown detail explain (@0x0) received by PID 4706 (TID 0x7f0bf7546700) from PID 0; stack trace: *** 0# doris::signal::(anonymous namespace)::FailureSignalHandler(int, siginfo_t*, void*) at /mnt/disk2/apache-doris/be/src/common/signal_handler.h:420 1# 0x00007F0C5595C400 in /lib64/libc.so.6 2# doris::BaseFieldtypeTraits<(doris::FieldType)9>::equal(void const*, void const*) at /mnt/disk2/apache-doris/be/src/olap/types.h:491 3# doris::TupleReader::_unique_key_next_row(doris::RowCursor*, doris::MemPool*, doris::ObjectPool*, bool*) at /mnt/disk2/apache-doris/be/src/olap/tuple_reader.cpp:197 4# doris::Merger::merge_rowsets(std::shared_ptr<doris::Tablet>, doris::ReaderType, std::vector<std::shared_ptr<doris::RowsetReader>, std::allocator<std::shared_ptr<doris::RowsetReader> > > const&, doris::RowsetWriter*, doris::Merger::Statistics*) in /home/cv/apache-doris-be-1.1.2/be/lib/doris_be 5# doris::Compaction::do_compaction_impl(long) in /home/cv/apache-doris-be-1.1.2/be/lib/doris_be 6# doris::Compaction::do_compaction(long) at /mnt/disk2/apache-doris/be/src/olap/compaction.cpp:112 7# doris::CumulativeCompaction::execute_compact_impl() in /home/cv/apache-doris-be-1.1.2/be/lib/doris_be 8# doris::Compaction::execute_compact() at /mnt/disk2/apache-doris/be/src/olap/compaction.cpp:50 9# doris::Tablet::execute_compaction(doris::CompactionType) in /home/cv/apache-doris-be-1.1.2/be/lib/doris_be 10# std::_Function_handler<void (), doris::StorageEngine::_submit_compaction_task(std::shared_ptr<doris::Tablet>, doris::CompactionType)::$_12>::_M_invoke(std::_Any_data const&) at /mnt/disk2/ygl/installs/ldbtools/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/std_function.h:291 11# doris::ThreadPool::dispatch_thread() at /mnt/disk2/apache-doris/be/src/util/threadpool.cpp:578 12# doris::Thread::supervise_thread(void*) at /mnt/disk2/apache-doris/be/src/util/thread.cpp:407 13# start_thread in /lib64/libpthread.so.0 14# clone in /lib64/libc.so.6 be.INFO日志如下: I1110 17:11:43.801707 4706 daemon.cpp:240] version 1.1.2-rc05 RELEASE (build git://hk-dev01/mnt/disk2/apache-doris@a8323dae4f93cc4653b9b071607090449208fd7c) Built on Fri, 09 Sep 2022 18:12:12 CST by ygl@hk-dev01 I1110 17:11:43.806525 4706 mem_info.cpp:89] Physical Memory: 31.26 GB I1110 17:11:43.810570 4706 daemon.cpp:272] Cpu Info: Model: Intel(R) Xeon(R) Silver 4208 CPU @ 2.10GHz Cores: 32 Max Possible Cores: 32 L1 Cache: 32.00 KB (Line: 64.00 B) L2 Cache: 1.00 MB (Line: 64.00 B) L3 Cache: 11.00 MB (Line: 64.00 B) Hardware Supports: ssse3 sse4_1 sse4_2 popcnt avx avx2 Numa Nodes: 2 Numa Nodes of Cores: 0->0 | 1->0 | 2->0 | 3->0 | 4->0 | 5->0 | 6->0 | 7->0 | 8->0 | 9->0 | 10->0 | 11->0 | 12->0 | 13->0 | 14->0 | 15->0 | 16->1 | 17->1 | 18->1 | 19->1 | 20->1 | 21->1 | 22->1 | 23->1 | 24->1 | 25->1 | 26->1 | 27->1 | 28->1 | 29->1 | 30->1 | 31->1 | I1110 17:11:43.810644 4706 daemon.cpp:273] Disk Info: Num disks 3: sda, sr, dm- I1110 17:11:43.810652 4706 daemon.cpp:274] Physical Memory: 31.26 GB Memory Limt: 25.01 GB Current Usage: 0 CGroup Info: Process CGroup Info: memory.limit_in_bytes=9223372036854771712, cpu cfs limits: unlimited I1110 17:11:43.811475 4706 backend_options.cpp:88] priority cidrs in conf: 10.10.11.0/24 I1110 17:11:43.811609 4706 backend_options.cpp:76] local host ip=10.10.11.151 I1110 17:11:43.815408 4706 exec_env_init.cpp:118] scan thread pool use PriorityWorkStealingThreadPool I1110 17:11:43.816560 4771 fragment_mgr.cpp:699] FragmentMgr cancel worker start working. I1110 17:11:43.873380 4706 load_path_mgr.cpp:58] Load path configured to [/home/cv/apache-doris-be-1.1.2/be/storage/mini_download] I1110 17:11:43.873428 4848 result_buffer_mgr.cpp:142] result buffer manager cancel thread begin. I1110 17:11:43.873529 4706 exec_env_init.cpp:223] Using global memory limit: 25.01 GB, origin config value: 80% I1110 17:11:43.873634 4706 exec_env_init.cpp:264] Buffer pool memory limit: 5.00 GB, origin config value: 20%. clean pages limit: 2.50 GB, origin config value: 50% I1110 17:11:43.876905 4706 exec_env_init.cpp:280] Storage page cache memory limit: 5.00 GB, origin config value: 20% I1110 17:11:43.878834 4706 tmp_file_mgr.cc:113] Using scratch directory /home/cv/apache-doris-be-1.1.2/be/storage/doris-scratch on disk 2 I1110 17:11:43.878902 4706 exec_env_init.cpp:309] Chunk allocator memory limit: 2.00 GB, origin config value: 2147483648 I1110 17:11:43.881217 4706 storage_engine.cpp:100] starting backend using uid:bf427e76d6265c55-0ff40281990f2189 I1110 17:11:43.881491 4854 data_dir.cpp:739] path: /home/cv/apache-doris-be-1.1.2/be/storage total capacity: 458232905728, available capacity: 380353990656 I1110 17:11:43.881717 4854 data_dir.cpp:204] path: /home/cv/apache-doris-be-1.1.2/be/storage, hash: 5426859093020019257 I1110 17:11:43.976120 4706 storage_engine.cpp:256] stream load record path: /home/cv/apache-doris-be-1.1.2/be/storage I1110 17:11:44.043159 4940 data_dir.cpp:386] start to load tablets from /home/cv/apache-doris-be-1.1.2/be/storage I1110 17:11:44.043560 4940 data_dir.cpp:379] successfully check incompatible old format meta /home/cv/apache-doris-be-1.1.2/be/storage I1110 17:11:44.043581 4940 data_dir.cpp:396] begin loading rowset from meta I1110 17:11:44.043814 4940 data_dir.cpp:416] load rowset from meta finished, data dir: /home/cv/apache-doris-be-1.1.2/be/storage I1110 17:11:44.043828 4940 data_dir.cpp:421] begin loading tablet from meta I1110 17:11:44.242991 4940 data_dir.cpp:470] load tablet from meta finished, loaded tablet: 4539, error tablet: 0, path: /home/cv/apache-doris-be-1.1.2/be/storage I1110 17:11:44.243028 4940 data_dir.cpp:543] finish to load tablets from /home/cv/apache-doris-be-1.1.2/be/storage, total rowset meta: 3, invalid rowset num: 0 I1110 17:11:44.244043 4706 storage_engine.cpp:104] success to init storage engine. I1110 17:11:44.244122 4706 olap_server.cpp:50] unused rowset monitor thread started I1110 17:11:44.244285 4706 olap_server.cpp:56] garbage sweeper thread started I1110 17:11:44.244421 4706 olap_server.cpp:62] disk stat monitor thread started I1110 17:11:44.248678 4706 olap_server.cpp:114] compaction tasks producer thread started I1110 17:11:44.248697 4985 olap_server.cpp:426] try to start compaction producer process! I1110 17:11:44.248818 4706 olap_server.cpp:128] tablet checkpoint tasks producer thread started I1110 17:11:44.248914 4986 olap_server.cpp:331] begin to produce tablet meta checkpoint tasks. I1110 17:11:44.248952 4706 olap_server.cpp:134] fd cache clean thread started I1110 17:11:44.250069 4988 olap_server.cpp:312] try to perform path scan! I1110 17:11:44.250088 4988 data_dir.cpp:660] start to scan data dir path:/home/cv/apache-doris-be-1.1.2/be/storage I1110 17:11:44.250088 4706 olap_server.cpp:156] path scan/gc threads started. number:1 I1110 17:11:44.250108 4706 olap_server.cpp:159] all storage engine's background threads are started. I1110 17:11:44.250173 4990 olap_server.cpp:287] try to start path gc thread! I1110 17:11:44.250198 4990 olap_server.cpp:290] try to perform path gc by tablet! I1110 17:11:44.254791 4974 compaction.cpp:138] start cumulative compaction. tablet=223773.1946636312.9142e80eeff35fb3-beb419325a9c37b4, output_version=[2-60], permits: 59 I1110 17:11:44.260146 4989 tablet_manager.cpp:1202] finish to do meta checkpoint on dir: /home/cv/apache-doris-be-1.1.2/be/storage, number: 0, cost(ms): 1 I1110 17:11:44.262894 4706 agent_server.cpp:99] Register user resource listener I1110 17:11:44.262941 4706 backend_service.cpp:82] DorisInternalService listening on 9060 I1110 17:11:44.263602 4706 thrift_server.cpp:355] ThriftServer 'backend' started on port: 9060 I1110 17:11:44.270102 4973 compaction.cpp:138] start cumulative compaction. tablet=223777.1946636312.2449a4f3990d0cf0-5192af096b13689b, output_version=[2-60], permits: 59 I1110 17:11:44.285086 4706 server.cpp:1066] Server[doris::PInternalServiceImpl<doris::PBackendService>] is serving on port=8060. I1110 17:11:44.285102 4706 server.cpp:1069] Check out http://localhost.localdomain:8060 in web browser. I1110 17:11:44.309863 4706 thrift_server.cpp:355] ThriftServer 'heartbeat' started on port: 9050 I1110 17:11:44.350798 4988 data_dir.cpp:709] scan data dir path: /home/cv/apache-doris-be-1.1.2/be/storage finished. path size: 5914 I1110 17:11:44.351671 4990 data_dir.cpp:567] start to path gc by tablet schemahash. I1110 17:11:44.410344 4990 data_dir.cpp:606] finished one time path gc by tablet. I1110 17:11:44.410378 4990 olap_server.cpp:293] try to perform path gc by rowsetid! I1110 17:11:44.410387 4990 data_dir.cpp:617] start to path gc by rowsetid. I1110 17:11:44.431969 4990 data_dir.cpp:650] finished one time path gc by rowsetid. --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@doris.apache.org For additional commands, e-mail: dev-h...@doris.apache.org