so, it seems you have old ofed w/o this parameter.
Can you install latest Mellanox ofed? or check which community ofed has it?


On Tue, Aug 19, 2014 at 9:34 AM, Rio Yokota <rioyok...@mac.com> wrote:

> Here is what "modinfo mlx4_core" gives
>
> filename:
>   
> /lib/modules/3.13.0-34-generic/kernel/drivers/net/ethernet/mellanox/mlx4/mlx4_core.ko
> version:        2.2-1
> license:        Dual BSD/GPL
> description:    Mellanox ConnectX HCA low-level driver
> author:         Roland Dreier
> srcversion:     3AE29A0A6538EBBE9227361
> alias:          pci:v000015B3d00001010sv*sd*bc*sc*i*
> alias:          pci:v000015B3d0000100Fsv*sd*bc*sc*i*
> alias:          pci:v000015B3d0000100Esv*sd*bc*sc*i*
> alias:          pci:v000015B3d0000100Dsv*sd*bc*sc*i*
> alias:          pci:v000015B3d0000100Csv*sd*bc*sc*i*
> alias:          pci:v000015B3d0000100Bsv*sd*bc*sc*i*
> alias:          pci:v000015B3d0000100Asv*sd*bc*sc*i*
> alias:          pci:v000015B3d00001009sv*sd*bc*sc*i*
> alias:          pci:v000015B3d00001008sv*sd*bc*sc*i*
> alias:          pci:v000015B3d00001007sv*sd*bc*sc*i*
> alias:          pci:v000015B3d00001006sv*sd*bc*sc*i*
> alias:          pci:v000015B3d00001005sv*sd*bc*sc*i*
> alias:          pci:v000015B3d00001004sv*sd*bc*sc*i*
> alias:          pci:v000015B3d00001003sv*sd*bc*sc*i*
> alias:          pci:v000015B3d00001002sv*sd*bc*sc*i*
> alias:          pci:v000015B3d0000676Esv*sd*bc*sc*i*
> alias:          pci:v000015B3d00006746sv*sd*bc*sc*i*
> alias:          pci:v000015B3d00006764sv*sd*bc*sc*i*
> alias:          pci:v000015B3d0000675Asv*sd*bc*sc*i*
> alias:          pci:v000015B3d00006372sv*sd*bc*sc*i*
> alias:          pci:v000015B3d00006750sv*sd*bc*sc*i*
> alias:          pci:v000015B3d00006368sv*sd*bc*sc*i*
> alias:          pci:v000015B3d0000673Csv*sd*bc*sc*i*
> alias:          pci:v000015B3d00006732sv*sd*bc*sc*i*
> alias:          pci:v000015B3d00006354sv*sd*bc*sc*i*
> alias:          pci:v000015B3d0000634Asv*sd*bc*sc*i*
> alias:          pci:v000015B3d00006340sv*sd*bc*sc*i*
> depends:
> intree:         Y
> vermagic:       3.13.0-34-generic SMP mod_unload modversions
> signer:         Magrathea: Glacier signing key
> sig_key:        50:0B:C5:C8:7D:4B:11:5C:F3:C1:50:4F:7A:92:E2:33:C6:14:3D:58
> sig_hashalgo:   sha512
> parm:           debug_level:Enable debug tracing if > 0 (int)
> parm:           msi_x:attempt to use MSI-X if nonzero (int)
> parm:           num_vfs:enable #num_vfs functions if num_vfs > 0
> num_vfs=port1,port2,port1+2 (array of byte)
> parm:           probe_vf:number of vfs to probe by pf driver (num_vfs > 0)
> probe_vf=port1,port2,port1+2 (array of byte)
> parm:           log_num_mgm_entry_size:log mgm size, that defines the num
> of qp per mcg, for example: 10 gives 248.range: 7 <= log_num_mgm_entry_size
> <= 12. To activate device managed flow steering when available, set to -1
> (int)
> parm:           enable_64b_cqe_eqe:Enable 64 byte CQEs/EQEs when the FW
> supports this (default: True) (bool)
> parm:           log_num_mac:Log2 max number of MACs per ETH port (1-7)
> (int)
> parm:           log_num_vlan:Log2 max number of VLANs per ETH port (0-7)
> (int)
> parm:           use_prio:Enable steering by VLAN priority on ETH ports
> (0/1, default 0) (bool)
> parm:           log_mtts_per_seg:Log2 number of MTT entries per segment
> (1-7) (int)
> parm:           port_type_array:Array of port types: HW_DEFAULT (0) is
> default 1 for IB, 2 for Ethernet (array of int)
> parm:           enable_qos:Enable Quality of Service support in the HCA
> (default: off) (bool)
> parm:           internal_err_reset:Reset device on internal errors if
> non-zero (default 1, in SRIOV mode default is 0) (int)
>
> most likely you installing old ofed which does not have this parameter:
>
> try:
>
> #modinfo mlx4_core
>
> and see if it is there.
> I would suggest install latest OFED or Mellanox OFED.
>
>
> On Mon, Aug 18, 2014 at 9:53 PM, Rio Yokota <rioyok...@mac.com> wrote:
>
>> I get "ofed_info: command not found". Note that I don't install the
>> entire OFED, but do a component wise installation by doing "apt-get install
>> infiniband-diags ibutils ibverbs-utils libmlx4-dev" for the drivers and
>> utilities.
>>
>> Hi,
>> what ofed version do you use?
>> (ofed_info -s)
>>
>>
>> On Sun, Aug 17, 2014 at 7:16 PM, Rio Yokota <rioyok...@mac.com> wrote:
>>
>>> I have recently upgraded from Ubuntu 12.04 to 14.04 and OpenMPI gives
>>> the following warning upon execution, which did not appear before the
>>> upgrade.
>>>
>>> WARNING: It appears that your OpenFabrics subsystem is configured to only
>>> allow registering part of your physical memory. This can cause MPI jobs
>>> to
>>> run with erratic performance, hang, and/or crash.
>>>
>>> Everything that I could find on google suggests to change log_num_mtt,
>>> but I cannot do this for the following reasons:
>>> 1. There is no log_num_mtt in /sys/module/mlx4_core/parameters/
>>> 2. Adding "options mlx4_core log_num_mtt=24" to
>>> /etc/modprobe.d/mlx4.conf doesn't seem to change anything
>>> 3. I am not sure how I can restart the driver because there is no
>>> "/etc/init.d/openibd" file (I've rebooted the system but it didn't do
>>> anything to create log_num_mtt)
>>>
>>> [Template information]
>>> 1. OpenFabrics is from the Ubuntu distribution using "apt-get install
>>> infiniband-diags ibutils ibverbs-utils libmlx4-dev"
>>> 2. OS is Ubuntu 14.04 LTS
>>> 3. Subnet manager is from the Ubuntu distribution using "apt-get install
>>> opensm"
>>> 4. Output of ibv_devinfo is:
>>> hca_id: mlx4_0
>>>         transport:                      InfiniBand (0)
>>>         fw_ver:                         2.10.600
>>>         node_guid:                      0002:c903:003d:52b0
>>>         sys_image_guid:                 0002:c903:003d:52b3
>>>         vendor_id:                      0x02c9
>>>         vendor_part_id:                 4099
>>>         hw_ver:                         0x0
>>>         board_id:                       MT_1100120019
>>>         phys_port_cnt:                  1
>>>                 port:   1
>>>                         state:                  PORT_ACTIVE (4)
>>>                         max_mtu:                4096 (5)
>>>                         active_mtu:             4096 (5)
>>>                         sm_lid:                 1
>>>                         port_lid:               1
>>>                         port_lmc:               0x00
>>>                         link_layer:             InfiniBand
>>> 5. Output of ifconfig for IB is
>>> ib0       Link encap:UNSPEC  HWaddr
>>> 80-00-00-48-FE-80-00-00-00-00-00-00-00-00-00-00
>>>           inet addr:192.168.1.1  Bcast:192.168.1.255  Mask:255.255.255.0
>>>           inet6 addr: fe80::202:c903:3d:52b1/64 Scope:Link
>>>           UP BROADCAST RUNNING MULTICAST  MTU:2044  Metric:1
>>>           RX packets:26 errors:0 dropped:0 overruns:0 frame:0
>>>           TX packets:34 errors:0 dropped:16 overruns:0 carrier:0
>>>           collisions:0 txqueuelen:256
>>>           RX bytes:5843 (5.8 KB)  TX bytes:4324 (4.3 KB)
>>> 6. ulimit -l is "unlimited"
>>>
>>> Thanks,
>>> Rio
>>> _______________________________________________
>>> users mailing list
>>> us...@open-mpi.org
>>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
>>> Link to this post:
>>> http://www.open-mpi.org/community/lists/users/2014/08/25048.php
>>>
>>
>> _______________________________________________
>> users mailing list
>> us...@open-mpi.org
>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
>> Link to this post:
>> http://www.open-mpi.org/community/lists/users/2014/08/25049.php
>>
>>
>>
>> _______________________________________________
>> users mailing list
>> us...@open-mpi.org
>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
>> Link to this post:
>> http://www.open-mpi.org/community/lists/users/2014/08/25062.php
>>
>
> _______________________________________________
> users mailing list
> us...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
> Link to this post:
> http://www.open-mpi.org/community/lists/users/2014/08/25063.php
>
>
>
> _______________________________________________
> users mailing list
> us...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
> Link to this post:
> http://www.open-mpi.org/community/lists/users/2014/08/25069.php
>

Reply via email to