We are getting the following on our RHEL6 cluster using openmpi 1.8.1 with meep http://ab-initio.mit.edu/wiki/index.php/Meep
WARNING: at fs/hugetlbfs/inode.c:940 hugetlb_file_setup+0x227/0x250() (Tainted: P --------------- ) Hardware name: C6100 Using mlock ulimits for SHM_HUGETLB deprecated Modules linked in: rdma_ucm(U) openafs(P)(U) autofs4 mgc(U) lustre(U) lov(U) mdc(U) lquota(U) osc(U) ksocklnd(U) ko2iblnd(U) rdma_cm(U) iw_cm(U) ib_addr(U) ptlrpc(U) obdclass(U) lnet(U) lvfs(U) libcfs(U) nfs lockd fscache auth_rpcgss nfs_acl sunrpc acpi_cpufreq freq_table mperf ipt_REJECT nf_conntrack_ipv4 nf_defrag_ipv4 xt_state nf_conntrack xt_multiport iptable_filter ip_tables ip6_tables ib_ipoib(U) ib_cm(U) ipv6 ib_uverbs(U) ib_umad(U) iw_nes(U) libcrc32c cxgb3 mdio mlx4_vnic(U) mlx4_vnic_helper(U) ib_sa(U) mlx4_ib(U) mlx4_en(U) mlx4_core(U) ib_mthca(U) ib_mad(U) ib_core(U) mic(U) vhost_net macvtap macvlan tun kvm ipmi_devintf igb ptp pps_core dcdbas microcode i2c_i801 i2c_core sg iTCO_wdt iTCO_vendor_support ioatdma dca i7core_edac edac_core shpchp ext4 jbd2 mbcache sd_mod crc_t10dif ahci dm_mirror dm_region_hash dm_log dm_mod [last unloaded: scsi_wait_scan] Pid: 14367, comm: meep-mpi Tainted: P --------------- 2.6.32-358.23.2.el6.x86_64 #1 Call Trace: [<ffffffff8106e3e7>] ? warn_slowpath_common+0x87/0xc0 [<ffffffff8106e4d6>] ? warn_slowpath_fmt+0x46/0x50 [<ffffffff8114615c>] ? user_shm_lock+0x9c/0xc0 [<ffffffff811ffbd7>] ? hugetlb_file_setup+0x227/0x250 [<ffffffff81281720>] ? sprintf+0x40/0x50 [<ffffffff8120e112>] ? newseg+0x152/0x290 [<ffffffff81208f51>] ? ipcget+0x61/0x200 [<ffffffff8114765e>] ? remove_vma+0x6e/0x90 [<ffffffff8120dfa9>] ? sys_shmget+0x59/0x60 [<ffffffff8120dfc0>] ? newseg+0x0/0x290 [<ffffffff8120dfb0>] ? shm_security+0x0/0x10 [<ffffffff8120d510>] ? shm_more_checks+0x0/0x20 [<ffffffff8100b072>] ? system_call_fastpath+0x16/0x1b ---[ end trace 375c130ede6f14a0 ]--- Doing some googling looks like this could be hurting our performance, but i'm not sure what todo about it? There is nothing on the list, but there was one reference to another MPI library. Is there any idea what would cause this? Brock Palen www.umich.edu/~brockp CAEN Advanced Computing XSEDE Campus Champion bro...@umich.edu (734)936-1985
signature.asc
Description: Message signed with OpenPGP using GPGMail