from:"Joe Jin"

Re: [vdpa_sim_net] 79991caf52: net/ipv4/ipmr.c:#RCU-list_traversed_in_non-reader_section

2021-02-08 Thread Joe Jin

On 2/7/21 12:15 PM, Dongli Zhang wrote: > Is it possible that the issue is not due to this change? Looks this issue does not related your change, from dmesg output, when issue occurred, virtio was not loaded: [ 502.508450] [ cut here ] [ 502.511859] WARNING: CPU: 0 PID:

Re: [PATCH v2 1/1] vhost scsi: alloc vhost_scsi with kvzalloc() to avoid delay

2021-02-01 Thread Joe Jin

ize to allocate is more than >> PAGE_SZIE and when __GFP_RETRY_MAYFAIL is not explicitly set. >> >> Cc: Aruna Ramakrishna >> Cc: Joe Jin >> Signed-off-by: Dongli Zhang >> --- >> Changed since v1: >> - To combine kzalloc() and vzalloc() as kvzalloc()

Re: [PATCH] xen/swiotlb: correct the check for xen_destroy_contiguous_region

2020-04-28 Thread Joe Jin

On 4/28/20 10:25 AM, Konrad Rzeszutek Wilk wrote: > On Tue, Apr 28, 2020 at 12:19:41PM +0200, Jürgen Groß wrote: >> On 28.04.20 10:25, Peng Fan wrote: > > Adding Joe Jin. > > Joe, didn't you have some ideas on how this could be implemented? > >>>> Subj

Re: [PATCH] tracing: make exported ftrace_set_clr_event non-static

2019-07-07 Thread Joe Jin

Patch looks good to me. Reviewed-by: Joe Jin Thanks, Joe On 7/4/19 10:21 AM, Denis Efremov wrote: > The function ftrace_set_clr_event is declared static and marked > EXPORT_SYMBOL_GPL(), which is at best an odd combination. Because the > function was decided to be a part of API, th

Re: [PATCH v2 1/2] swiotlb: add debugfs to track swiotlb buffer usage

2018-12-10 Thread Joe Jin

On 12/10/18 12:00 PM, Tim Chen wrote: >> @@ -528,6 +538,9 @@ phys_addr_t swiotlb_tbl_map_single(struct device *hwdev, >> dev_warn(hwdev, "swiotlb buffer is full (sz: %zd bytes)\n", >> size); >> return SWIOTLB_MAP_ERROR; >> found: >> +#ifdef CONFIG_DEBUG_FS >> +io_tlb_used +=

Re: [PATCH v2 1/2] swiotlb: add debugfs to track swiotlb buffer usage

2018-12-10 Thread Joe Jin

er usage via debugfs would help the user estimate the size of > swiotlb buffer to pre-allocate or analyze device driver memory leak issue. > > Signed-off-by: Dongli Zhang Reviewed-by: Joe Jin > --- > Changed since v1: > * init debugfs with late_initcall (suggested by Robi

Re: [PATCH v2 2/2] swiotlb: checking whether swiotlb buffer is full with io_tlb_used

2018-12-10 Thread Joe Jin

On 12/9/18 4:37 PM, Dongli Zhang wrote: > This patch uses io_tlb_used to help check whether swiotlb buffer is full. > io_tlb_used is no longer used for only debugfs. It is also used to help > optimize swiotlb_tbl_map_single(). > > Suggested-by: Joe Jin > Signed-off-by: Dongli Z

Re: [PATCH RFC 1/1] swiotlb: add debugfs to track swiotlb buffer usage

2018-12-06 Thread Joe Jin

On 12/6/18 9:49 PM, Dongli Zhang wrote: > > > On 12/07/2018 12:12 AM, Joe Jin wrote: >> Hi Dongli, >> >> Maybe move d_swiotlb_usage declare into swiotlb_create_debugfs(): > > I assume the call of swiotlb_tbl_map_single() might be frequent in some > s

Re: [PATCH RFC 1/1] swiotlb: add debugfs to track swiotlb buffer usage

2018-12-06 Thread Joe Jin

Hi Dongli, Maybe move d_swiotlb_usage declare into swiotlb_create_debugfs(): void swiotlb_create_debugfs(void) { #ifdef CONFIG_DEBUG_FS static struct dentry *d_swiotlb_usage = NULL; if (d_swiotlb_usage) return; d_swiotlb_usage = debugfs_create_dir("swiotl

Re: [PATCH 4.4 010/268] xen-swiotlb: fix the check condition for xen_swiotlb_free_coherent

2018-06-07 Thread Joe Jin

On 6/7/18 1:28 PM, Ben Hutchings wrote: > On Mon, 2018-05-28 at 11:59 +0200, Greg Kroah-Hartman wrote: >> 4.4-stable review patch. If anyone has any objections, please let me know. >> >> ---------- >> >> From: Joe Jin >> >> commit 4855

[PATCH] xen-swiotlb: fix the check condition for xen_swiotlb_free_coherent

2018-05-17 Thread Joe Jin

DMA heap. This issue introduced by commit 6810df88dcfc2 "xen-swiotlb: When doing coherent alloc/dealloc check before swizzling the MFNs.". Signed-off-by: Joe Jin Tested-by: John Sobecki Reviewed-by: Rzeszutek Wilk Cc: sta...@vger.kernel.org --- drivers/xen/swiotlb-xen.c | 2 +- 1 fi

Re: [PATCH UPSTREAM] xen-swiotlb: fix the check condition for xen_swiotlb_free_coherent

2018-05-17 Thread Joe Jin

On 5/17/18 12:10 PM, Greg KH wrote: > On Thu, May 17, 2018 at 11:45:57AM -0700, Joe Jin wrote: >> When run raidconfig from Dom0 we found that the Xen DMA heap is reduced, >> but Dom Heap is increased by the same size. Tracing raidconfig we found >> that the related ioctl()

[PATCH UPSTREAM] xen-swiotlb: fix the check condition for xen_swiotlb_free_coherent

2018-05-17 Thread Joe Jin

DMA heap. This issue introduced by commit 6810df88dcfc2 "xen-swiotlb: When doing coherent alloc/dealloc check before swizzling the MFNs.". Signed-off-by: Joe Jin Tested-by: John Sobecki Reviewed-by: Rzeszutek Wilk Cc: sta...@vger.kernel.org --- drivers/xen/swiotlb-xen.c | 2 +- 1 fi

Re: [PATCH V2] [scsi] enclosure: remove duplicate device before add new

2013-09-24 Thread Joe Jin

Hi James, Can you please help to review the patch and comment it? Thanks, Joe On 09/20/13 08:16, Joe Jin wrote: > When do disk pull/insert test we encountered below: > > WARNING: at fs/sysfs/dir.c:455 sysfs_add_one+0xbc/0xe0() > Hardware name: SUN FIRE X4370 M2 SERVER > sysfs

[PATCH V2] [scsi] enclosure: remove duplicate device before add new

2013-09-19 Thread Joe Jin

remove the possible duplicate entry to avoid the conflict when we add new one. Cc: James Bottomley Signed-off-by: Joe Jin --- drivers/misc/enclosure.c | 2 ++ 1 file changed, 2 insertions(+) diff --git a/drivers/misc/enclosure.c b/drivers/misc/enclosure.c index 0e8df41..173974d 100644 --- a/dr

Re: [PATCH] [scsi] enclosure: remove all possible sysfs entries before add device

2013-09-11 Thread Joe Jin

for enclosure_device:HDD10, the index of component is not same then conflicted. BTW, 6:0:27:0 and 7:0:27:0 are same disk. > >> > Cc: James Bottomley >> > Signed-off-by: Joe Jin >> > --- >> > drivers/misc/enclosure.c | 7 +++ >> > 1 file cha

Re: [PATCH] [scsi] enclosure: remove all possible sysfs entries before add device

2013-09-09 Thread Joe Jin

On 09/09/13 21:41, Christoph Hellwig wrote: >> Modules linked in: oracleacfs(P)(U) oracleadvm(P)(U) oracleoks(P)(U) > > Please reproduce without this weird crap loaded. > These modules is filesystem and will not impact enclosure. Thanks, Joe -- To unsubscribe from this list: send the line "u

[PATCH] [scsi] enclosure: remove all possible sysfs entries before add device

2013-09-08 Thread Joe Jin

g our test, multipath used, each LUN has 2 paths. when adding second path enclousure did not check if will adding device's symlink existed or no. Cc: James Bottomley Signed-off-by: Joe Jin --- drivers/misc/enclosure.c | 7 +++ 1 file changed, 7 insertions(+) diff --git a/drivers/misc/enclos

Re: [PATCH] dm: allow error target to replace either bio-based and request-based targets

2013-08-22 Thread Joe Jin

ing either bios or requests. > > Add a request-based (.map_rq) member to the error target_type and train > dm_table_set_type() to prefer the md's established type (request-based > or bio-based). If the md doesn't have an established type default to > making the hybrid

Re: [dm-devel] [PATCH v2] dm ioctl: allow change device target type to error

2013-08-21 Thread Joe Jin

t; for there is not map_rq for error target type. Signed-off-by: Joe Jin --- drivers/md/dm-target.c | 7 +++ 1 file changed, 7 insertions(+) diff --git a/drivers/md/dm-target.c b/drivers/md/dm-target.c index 37ba5db..b690910 100644 --- a/drivers/md/dm-target.c +++ b/drivers/md/dm-target.c @@

Re: [PATCH v2] dm ioctl: allow change device target type to error

2013-08-21 Thread Joe Jin

On 08/21/13 23:06, Mike Snitzer wrote: > On Wed, Aug 21 2013 at 10:48am -0400, > Mikulas Patocka wrote: > >> >> >> On Wed, 21 Aug 2013, Joe Jin wrote: >> >>> commit a5664da "dm ioctl: make bio or request based device type immutable" >>

[PATCH v2] dm ioctl: allow change device target type to error

2013-08-21 Thread Joe Jin

commit a5664da "dm ioctl: make bio or request based device type immutable" prevented "dmsetup wape_table" change the target type to "error". -v2: setup md->queue even target type is "error". Signed-off-by: Joe Jin --- drivers/md/dm-io

[PATCH] dm ioctl: allow change device target type to error

2013-08-21 Thread Joe Jin

commit a5664da "dm ioctl: make bio or request based device type immutable" prevented "dmsetup wape_table" change the target type to "error". Signed-off-by: Joe Jin --- drivers/md/dm-ioctl.c | 6 +- drivers/md/dm-table.c | 12 drivers/md/dm.h

Re: [Xen-devel] [PATCH] xen: initialize xen panic handler for PVHVM

2013-08-18 Thread Joe Jin

On 08/16/13 20:43, Konrad Rzeszutek Wilk wrote: > Could you tell me what has been happening without this patch? Without this patch, Xen would not get pvhvm crash event, any config for on_crash in guest configure file will not be triggered. Thanks, Joe -- To unsubscribe from this list: send the l

Re: kernel panic in skb_copy_bits

2013-07-04 Thread Joe Jin

On 07/01/13 16:11, Ian Campbell wrote: > On Mon, 2013-07-01 at 11:18 +0800, Joe Jin wrote: >>> A workaround is to turn off O_DIRECT use by Xen as that ensures >>> the pages are copied. Xen 4.3 does this by default. >>> >>> I believe fixes for this are

Re: kernel panic in skb_copy_bits

2013-07-01 Thread Joe Jin

On 07/01/13 16:11, Ian Campbell wrote: > On Mon, 2013-07-01 at 11:18 +0800, Joe Jin wrote: >>> A workaround is to turn off O_DIRECT use by Xen as that ensures >>> the pages are copied. Xen 4.3 does this by default. >>> >>> I believe fixes for this are

Re: kernel panic in skb_copy_bits

2013-06-30 Thread Joe Jin

On 06/30/13 17:13, Alex Bligh wrote: > > > --On 28 June 2013 12:17:43 +0800 Joe Jin wrote: > >> Find a similar issue >> http://www.gossamer-threads.com/lists/xen/devel/265611 So copied to Xen >> developer as well. > > I thought this sounded familiar. I ha

Re: kernel panic in skb_copy_bits

2013-06-29 Thread Joe Jin

On 06/29/13 15:20, Eric Dumazet wrote: > On Sat, 2013-06-29 at 07:36 +0800, Joe Jin wrote: >> Hi Eric, >> >> The patch not fix the issue and panic as same as early I posted: >>> BUG: unable to handle kernel paging request at 88006d9e8d48 >>> IP: [] memcp

Re: kernel panic in skb_copy_bits

2013-06-28 Thread Joe Jin

neighbour code, because neigh_destroy() uses > skb_queue_purge(&neigh->arp_queue) without holding neighbour lock, > while other parts of the code assume neighbour rwlock is what > protects arp_queue > > Convert all skb_queue_purge() calls to the __skb_queue_purge() variant > >

Re: kernel panic in skb_copy_bits

2013-06-28 Thread Joe Jin

instead of skb_queue_head_init() > to make clear we do not use arp_queue.lock > > And hold neigh->lock in neigh_destroy() to close the race. > > Reported-by: Joe Jin > Signed-off-by: Eric Dumazet > --- > net/core/neighbour.c | 12 +++- > 1 file changed,

Re: kernel panic in skb_copy_bits

2013-06-27 Thread Joe Jin

Find a similar issue http://www.gossamer-threads.com/lists/xen/devel/265611 So copied to Xen developer as well. On 06/27/13 13:31, Eric Dumazet wrote: > On Thu, 2013-06-27 at 10:58 +0800, Joe Jin wrote: >> Hi, >> >> When we do fail over test with iscsi + multipath by reset t

Re: kernel panic in skb_copy_bits

2013-06-27 Thread Joe Jin

Hi Eric, Thanks for you response, will test it and get back to you. Regards, Joe On 06/27/13 13:31, Eric Dumazet wrote: > On Thu, 2013-06-27 at 10:58 +0800, Joe Jin wrote: >> Hi, >> >> When we do fail over test with iscsi + multipath by reset the switches >> on O

kernel panic in skb_copy_bits

2013-06-26 Thread Joe Jin

Hi, When we do fail over test with iscsi + multipath by reset the switches on OVM(2.6.39) we hit the panic: BUG: unable to handle kernel paging request at 88006d9e8d48 IP: [] memcpy+0xb/0x120 PGD 1798067 PUD 1fd2067 PMD 213f067 PTE 0 Oops: [#1] SMP CPU 7 Modules linked in: dm_nfs tun n

Re: [PATCH] ACPI: update user_policy.max when _PPC updated

2013-06-06 Thread Joe Jin

On 06/07/13 03:54, Rafael J. Wysocki wrote: > Do you mean you set a limit in the BIOS setup and the kernel changed that > limit > on boot? Sorry for the confusing. The issue is when we disable hardcap before kernel boot up, after kernel bring up, any changes of _PPC will update scaling_max_freq

Re: [PATCH] ACPI: update user_policy.max when _PPC updated

2013-06-06 Thread Joe Jin

On 06/06/13 19:06, Rafael J. Wysocki wrote: > On Thursday, June 06, 2013 08:27:08 AM Joe Jin wrote: >> On 06/06/13 04:40, Rafael J. Wysocki wrote: >>> On Wednesday, June 05, 2013 08:52:52 AM Joe Jin wrote: >>>> When _PPC changed dynamically the user_policy.max

Re: [PATCH] ACPI: update user_policy.max when _PPC updated

2013-06-05 Thread Joe Jin

On 06/06/13 04:40, Rafael J. Wysocki wrote: > On Wednesday, June 05, 2013 08:52:52 AM Joe Jin wrote: >> When _PPC changed dynamically the user_policy.max will not be updated, >> this prevent CPU run on the highest frequency. > > Why should the user setting be always related t

[PATCH] ACPI: update user_policy.max when _PPC updated

2013-06-04 Thread Joe Jin

When _PPC changed dynamically the user_policy.max will not be updated, this prevent CPU run on the highest frequency. Signed-off-by: Joe Jin Cc: Rafael J. Wysocki Cc: Viresh Kumar --- drivers/acpi/processor_perflib.c | 17 - 1 file changed, 16 insertions(+), 1 deletion

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-12-18 Thread Joe Jin

Hi Yijing, Thanks for your reference, the patch looks good for me, but I have no chance to test it on customer's env. Best Regards, Joe On 12/19/12 13:52, Yijing Wang wrote: > On 2012/12/19 11:04, Joe Jin wrote: >> Hi all, >> >> I backported mps commits

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-12-18 Thread Joe Jin

AD) > Intel Corporation > todd.fujin...@intel.com > (503) 712-4565 > > > -Original Message- > From: Ethan Zhao [mailto:ethan.ker...@gmail.com] > Sent: Wednesday, November 28, 2012 7:10 PM > To: Fujinaka, Todd > Cc: Joe Jin; Ben Hutchings; Mary Mcgrath;

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-28 Thread Joe Jin

On 11/28/12 02:10, Ben Hutchings wrote: > On Tue, 2012-11-27 at 17:32 +, Fujinaka, Todd wrote: >> Forgive me if I'm being too repetitious as I think some of this has >> been mentioned in the past. >> >> We (and by we I mean the Ethernet part and driver) can only change the >> advertised availab

Re: [E1000-devel] 82571EB: Detected Hardware Unit Hang

2012-11-26 Thread Joe Jin

On 11/27/12 00:23, Fujinaka, Todd wrote: > If you look at the previous section, DevCap, you'll see that it's > correctly advertising 256 bytes but the system is negotiating 128 for > the link to the Ethernet controller. Things on the "other" side of the > link are controlled outside of the e1000 dr

Re: 82571EB: Detected Hardware Unit Hang

2012-11-20 Thread Joe Jin

On 11/20/12 16:59, Dave, Tushar N wrote: > Have you power off the system completely after modifying eeprom? If not > please do so. Hi Tushar, Seems not works for me, would you please help to check what is wrong of my operations? Original eeprom dump: # ethtool -e eth3 | head -8 Offset

Re: 82571EB: Detected Hardware Unit Hang

2012-11-20 Thread Joe Jin

On 11/20/12 16:59, Dave, Tushar N wrote: > Have you power off the system completely after modifying eeprom? If not > please do so. seems not works for me, would you please help to check what is wrong of my operations? Original eeprom dump: # ethtool -e eth3 | head -8 Offset Values ---

Re: 82571EB: Detected Hardware Unit Hang

2012-11-18 Thread Joe Jin

On 11/16/12 04:26, Dave, Tushar N wrote: >> Would you please help to fine the offset of max payload size in eeprom? >> I'd like to have a try to modify it by ethtool. > > It is defined using bit 8 of word 0x1A. > Bit value 0 = 128B , bit value 1 = 256B Hi Tushar, I checked one of my server which

Re: 82571EB: Detected Hardware Unit Hang

2012-11-14 Thread Joe Jin

On 11/14/12 11:45, Dave, Tushar N wrote: >> -Original Message- >> From: Joe Jin [mailto:joe@oracle.com] >> Sent: Tuesday, November 13, 2012 6:48 PM >> To: Dave, Tushar N >> Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >> ker...@vger

Re: 82571EB: Detected Hardware Unit Hang

2012-11-13 Thread Joe Jin

On 11/09/12 04:35, Dave, Tushar N wrote: > All devices in path from root complex to 82571, should have *same* max > payload size otherwise it can cause hang. > Can you double check this? Hi Tushar, Checked with hardware vendor and they said no way to modify the max payload size from BIOS, can

Re: 82571EB: Detected Hardware Unit Hang

2012-11-08 Thread Joe Jin

again, Joe > On Mon, 2012-07-09 at 16:51 +0800, Joe Jin wrote: >> > I'm seeing a Unit Hang even with the latest e1000e driver 2.0.0 when >> > doing scp test. this issue is easy do reproduced on SUN FIRE X2270 M2, >> > just copy a big file (>500M) from ano

82571EB: Detected Hardware Unit Hang

2012-11-07 Thread Joe Jin

Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr+ Stepping- SERR- FastB2B- DisINTx- - Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- SERR- TAbort- SERR- http://www.oracle.com> Joe Jin | Software Development Senior Manager | +8610.6106.5624 ORACLE | Linux and Virtualization No.

[PATCH] qla3xxx: Ensure request/response queue addr writes to the registers

2012-10-21 Thread Joe Jin

Before use the request and response queue addr, make sure it has wrote to the registers. Signed-off-by: Joe Jin Cc: Jitendra Kalsaria Cc: Ron Mercer --- drivers/net/ethernet/qlogic/qla3xxx.c | 9 +++-- 1 file changed, 7 insertions(+), 2 deletions(-) diff --git a/drivers/net/ethernet

[PATCH] qla3xxx: Ensure request/response queue addr writes to the registers

2012-10-18 Thread Joe Jin

Before use the request and response queue addr, make sure it has wrote to the registers. Signed-off-by: Joe Jin Cc: Jitendra Kalsaria Cc: Ron Mercer --- drivers/net/ethernet/qlogic/qla3xxx.c | 6 ++ 1 file changed, 6 insertions(+) diff --git a/drivers/net/ethernet/qlogic/qla3xxx.c b

[PATCH] qla3xxx: Ensure request/response queue addr writes to the registers

2012-10-18 Thread Joe Jin

Before use the request and response queue addr, make sure it has wrote to the registers. Signed-off-by: Joe Jin Cc: Jitendra Kalsaria Cc: Ron Mercer --- drivers/net/ethernet/qlogic/qla3xxx.c | 6 ++ 1 file changed, 6 insertions(+) diff --git a/drivers/net/ethernet/qlogic/qla3xxx.c b

Re: [PATCH] qla3xxx: Ensure req_q_phy_addr writes to the register

2012-10-17 Thread Joe Jin

On 10/18/12 01:45, Jitendra Kalsaria wrote: > > >> -Original Message----- >> From: Joe Jin [mailto:joe@oracle.com] >> Sent: Tuesday, October 16, 2012 11:32 PM >> To: Ron Mercer; Jitendra Kalsaria; Dept-Eng Linux Driver >> Cc: netdev; linux-ker

[PATCH] qla3xxx: Ensure req_q_phy_addr writes to the register

2012-10-16 Thread Joe Jin

Make sure req_q_phy_addr write to the register. Signed-off-by: Joe Jin Cc: Ron Mercer Cc: Jitendra Kalsaria --- drivers/net/ethernet/qlogic/qla3xxx.c | 6 ++ 1 file changed, 6 insertions(+) diff --git a/drivers/net/ethernet/qlogic/qla3xxx.c b/drivers/net/ethernet/qlogic/qla3xxx.c index

Re: 82571EB: Detected Hardware Unit Hang

2012-07-14 Thread Joe Jin

On 07/15/12 11:42, Dave, Tushar N wrote: >> -Original Message- >> From: Joe Jin [mailto:joe@oracle.com] >> Sent: Thursday, July 12, 2012 9:34 PM >> To: Dave, Tushar N >> Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >> ker...@vge

Re: 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Joe Jin

On 07/12/12 13:57, Dave, Tushar N wrote: >> -Original Message- >> From: Joe Jin [mailto:joe@oracle.com] >> Sent: Wednesday, July 11, 2012 8:13 PM >> To: Dave, Tushar N >> Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >> ker...@vge

Re: 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Joe Jin

On 07/12/12 11:07, Dave, Tushar N wrote: >> -Original Message- >> From: Joe Jin [mailto:joe@oracle.com] >> Sent: Wednesday, July 11, 2012 7:58 PM >> To: Dave, Tushar N >> Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >> ker...@vge

Re: 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Joe Jin

On 07/12/12 10:52, Dave, Tushar N wrote: > What is the exact error messages in BIOS log? Error message from BIOS event log: 07/12/12 05:54:00 PCI Express Non-Fatal Error Thanks, Joe -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majord..

Re: 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Joe Jin

On 07/12/12 02:51, Dave, Tushar N wrote: > > Joe, > > I see couple of errors in lspci output. > Device capability status register shows UnCorrectable PCIe error. This means > there is certainly something went wrong. The only way to recover from > Uncorrectable errors is reset. > > Dev

Re: 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Joe Jin

On 07/11/12 15:50, Dave, Tushar N wrote: > Device status and AER sections show some errors that looks little suspicious > to me but I'm not too sure. I will get back tomorrow. > Thanks a lot, Tushar! Joe -- Oracle <http://www.oracle.com> Joe Jin | Software Develo

Re: 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Joe Jin

On 07/11/12 15:37, Dave, Tushar N wrote: >> -Original Message- >> From: Joe Jin [mailto:joe@oracle.com] >> Sent: Wednesday, July 11, 2012 12:18 AM >> To: Dave, Tushar N >> Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >> ker...@vge

Re: 82571EB: Detected Hardware Unit Hang

2012-07-11 Thread Joe Jin

On 07/11/12 15:11, Dave, Tushar N wrote: >> -Original Message- >> From: Joe Jin [mailto:joe@oracle.com] >> Sent: Tuesday, July 10, 2012 10:03 PM >> To: Dave, Tushar N >> Cc: e1000-de...@lists.sf.net; net...@vger.kernel.org; linux- >> ker...@vge

Re: 82571EB: Detected Hardware Unit Hang

2012-07-10 Thread Joe Jin

On 07/11/12 12:05, Dave, Tushar N wrote: > When you said you had this issue with RHEL5 and RHEL6 drivers, have you > install RHEl5/6 kernel and reproduced it? If so I think I should install > RHEL6 and try reproduce it locally! > Yes I reproduced this on both RHEL5 and RHEL6. So far I tried to

Re: 82571EB: Detected Hardware Unit Hang

2012-07-10 Thread Joe Jin

On 07/11/12 11:22, Dave, Tushar N wrote: > Thanks for info. I see that hang occurs right when HW processing first TX > descriptor with TSO. > Would you be able to reproduce issue with TSO off? Disable TSO by 'ethtool > -K ethx tso off' > Let all debug enabled as it is, that will help us debug f

Re: 82571EB: Detected Hardware Unit Hang

2012-07-10 Thread Joe Jin

On 07/11/12 03:02, Dave, Tushar N wrote: >> -Original Message- >> From: netdev-ow...@vger.kernel.org [mailto:netdev-ow...@vger.kernel.org] >> On Behalf Of Joe Jin >> Sent: Tuesday, July 10, 2012 12:40 AM >> To: Joe Jin >> Cc: e1000-de...@lists.sf.net; n

Re: 82571EB: Detected Hardware Unit Hang

2012-07-10 Thread Joe Jin

count)) { <--- snip ---> 1487 } I checked all driver codes I did not found anywhere will set the upper.data with E1000_TXD_STAT_DD, I guess upper.data be set by hardware? If OS is 32bit system, what which happen? Thanks in advance, Joe On 07/09/12 16:51, Joe Jin wrote: > Hi

Re: 82571EB: Detected Hardware Unit Hang

2012-07-09 Thread Joe Jin

On 07/09/12 17:21, Eric Dumazet wrote: > On Mon, 2012-07-09 at 16:51 +0800, Joe Jin wrote: >> Hi list, >> >> I'm seeing a Unit Hang even with the latest e1000e driver 2.0.0 when doing >> scp test. this issue is easy do reproduced on SUN FIRE X2270 M2, just copy &g

82571EB: Detected Hardware Unit Hang

2012-07-09 Thread Joe Jin

Hi list, I'm seeing a Unit Hang even with the latest e1000e driver 2.0.0 when doing scp test. this issue is easy do reproduced on SUN FIRE X2270 M2, just copy a big file (>500M) from another server will hit it at once. Would you please help on this? device info: # lspci -s 05:00.0 05:00.0 Ethe

Re: [PATCH] add check do_direct_IO() return val

2007-07-31 Thread Joe Jin

> Does a patch like this work? I don't have any test-cases, but it would be > good to have something like this tested and passed back with proper > explanations and sign-offs. Yes it work find after apply the patch, thanks. Joe - To unsubscribe from this list: send the line "unsubscribe linux-k

Re: [PATCH] add check do_direct_IO() return val

2007-07-31 Thread Joe Jin

> Okay. Here is the fix for the problem. It work fine, thanks. > Signed-off-by: Badari Pulavarty <[EMAIL PROTECTED]> Acked-by: Joe Jin <[EMAIL PROTECTED]> - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMA

Re: [PATCH] add check do_direct_IO() return val

2007-07-30 Thread Joe Jin

> Hmm.. in this config file, whats causing DIO to panic ? Which test actually > passing faulty buffer ? > By my testing, just defined job3 and job10 will also get the panic, but if only have one of them, panic will not appear. the faulty buffer maybe passed by mmap. Thanks, Joe - To unsubscribe

Re: [PATCH] add check do_direct_IO() return val

2007-07-30 Thread Joe Jin

> Well, I'm having a heck of a time getting this to fail. It looks > possible, though. Joe, were you guys able to narrow it down to a > reproducible test case? Do you have any oops output messages from > the crashes? Zach, it easy to reproduce through fio with following config file # cat

Re: [PATCH] add check do_direct_IO() return val

2007-07-27 Thread Joe Jin

> I tested Andrew's patch and panic was gone but got few ENOTBLK. > So I tried with Joe's patch , both panic and ENOTBLK are gone now. > But in Joe's patch if (ret == -ENOTBLK && (rw & WRITE)), dio_cleanup(dio) > was not getting called because of break. So I moved dio_cleanup just > after if (ret)

Re: [PATCH] add check do_direct_IO() return val

2007-07-27 Thread Joe Jin

> If you are sure the problem is caused by EFAULT or ENOMEM , why not: To this issue(wengang have posted the background), I hit it for do_direct_IO() return EFAULT caused the panic, however, the function not only return these critical errors, maybe other error return like EIO. Thanks, Joe - To u

Re: [PATCH] add check do_direct_IO() return val

2007-07-27 Thread Joe Jin

> I think we still want to run dio_cleanup() if do_direct_IO() failed? > Otherwise we can leak pages. > > And there's nothing special about EFAULT or ENOMEM here: if do_direct_IO() > returns any error then that's it: we bale out, yes? > Yes, I think we'll out from here if get EFAULT/ENOMEM erro

[PATCH] add check do_direct_IO() return val

2007-07-26 Thread Joe Jin

ers, at this point, kernel will panic. Signed-off-by: Joe Jin <[EMAIL PROTECTED]> --- --- linux-2.6.22/fs/direct-io.c.orig2007-07-26 11:32:27.0 +0800 +++ linux-2.6.22/fs/direct-io.c 2007-07-26 11:33:58.0 +0800 @@ -1031,7 +1031,9 @@ direct_io_worker(int rw, struct

Re: [PATCH] Add nid sanity on alloc_pages_node

2007-07-18 Thread Joe Jin

Sorry for I use a unclean kernel tree, the panic have gone ;) also, I will testing it again with other server, if I hit it I'll let you know. Thanks, Joe - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at

Re: [PATCH] Add nid sanity on alloc_pages_node

2007-07-18 Thread Joe Jin

> > > > That is... surprising to me. > > To me also. I'd like to double-check the code which Joe actually > tested, please - have a sneaking suspicion that the 2.6.22 version > of that function was tested ;) Maybe I make a fault? I'll testing it again then give you a report. Thank, Joe - To un

Re: [PATCH] Add nid sanity on alloc_pages_node

2007-07-17 Thread Joe Jin

On 2007-07-18 05:49, Hugh Dickins wrote: > On Wed, 18 Jul 2007, Joe Jin wrote: > > > > With your patch, I have reproduced the panic: > > That is... surprising to me. (I hadn't been able to reproduce it with > or without the patches: maybe I just need to try har

Re: [PATCH] Add nid sanity on alloc_pages_node

2007-07-17 Thread Joe Jin

With your patch, I have reproduced the panic: Unable to handle kernel paging request at 186a RIP: [] __alloc_pages+0x2f/0x2c3 PGD 72595067 PUD 72594067 PMD 0 Oops: [1] SMP CPU 0 Modules linked in: xt_tcpudp iptable_filter ip_tables x_tables cpufreq_ondemand dm_mirror dm_mul

Re: [PATCH] Add nid sanity on alloc_pages_node

2007-07-13 Thread Joe Jin

> > Patch gone too ;) I deleted it. I was hoping that you'd send me the final > finished product (please). > Ha.., the patch against 2.6.22, at your patch have use htlb_alloc_mask, but I cannot found it at 2.6.22 kernel tree, I think you must use difference kernel tree :) Thanks, Joe --- linu

Re: [PATCH] Add nid sanity on alloc_pages_node

2007-07-13 Thread Joe Jin

> > if (nid > MAX_NUMNODES) then that is a bug and we should report it (doing > this via a BUG() is OK) rather than quietly covering it up. I have create a patch to check if nid > MAX_NUMNODES, please apply it thanks Signed-off-by: Joe Jin <[EMAIL PROTECTED]> --- --- linux

Re: [PATCH] Add nid sanity on alloc_pages_node

2007-07-13 Thread Joe Jin

> > > > The patch looks good for this bug, thanks :) > > If you have time could you test it and sent it back at me please? > Test passed and panic gone. thanks. > > if other caller give a invalid nid to alloc_pages_node(), __alloc_pages > > will crash again. > > That would be a buggy caller,

Re: [PATCH] Add nid sanity on alloc_pages_node

2007-07-13 Thread Joe Jin

On 2007-07-12 22:18, Andrew Morton wrote: > On Fri, 13 Jul 2007 10:45:07 +0800 Joe Jin <[EMAIL PROTECTED]> wrote: > > > This patch add nid sanity check on alloc_pages_node(). > > While two process change nr_hugepages at a system, alloc_fresh_huge_page() > > been

[PATCH] Add nid sanity on alloc_pages_node

2007-07-12 Thread Joe Jin

epages done Run the script at _two_ difference terminal, after a short time, a kernel panic info will print. Signed-off-by: Joe Jin <[EMAIL PROTECTED]> --- --- linux-2.6.22/include/linux/gfp.h.orig 2007-07-12 15:06:23.0 +0800 +++ linux-2.6.22/include/linux/gfp.h2007-07-12

Re: [PATCH] [scsi]: Add offline state checking while dispatch a scsi cmd

2007-03-11 Thread Joe Jin

> The 2.6.9 base is very old in mainline terms. Are you sure the bug hasn't > been fixed in mainline by other means? I cannot confirm if it have fixed in latest kernel, the server is a production system, it's hard to debug it and try reproduce. - To unsubscribe from this list: send the line "uns

Re: [PATCH] [scsi]: Add offline state checking while dispatch a scsi cmd

2007-03-11 Thread Joe Jin

> > This is a bug actually in the megaraid. Aha, I'll track it. > > And this is a direct command submission path: it already passed both > online check gates in this path *after* the device was offlined, so > adding a third won't fix this. Yeah, I have notice that, however, from the logs, th

Re: [PATCH] [scsi]: Add offline state checking while dispatch a scsi cmd

2007-03-08 Thread Joe Jin

> What's the error you're trying to fix? scsi_dispatch_cmd() is only > called from scsi_request_fn() which already has an equivalent of this > check in it just prior to calling dispatch. Yeah, I have saw the cheking at scsi_request_fn(), recently we got a crash info as following at rhel4 2.6.9-42

[PATCH] [scsi]: Add offline state checking while dispatch a scsi cmd

2007-03-08 Thread Joe Jin

While a scsi device hw error occured, device's status maybe setting to SDEV_OFFLINE, So at scsi_dispatch_cmd function, we should checking if device have offline, if yes, do nothing and just return error to user directly. Signed-off-by: Joe Jin <[EMAIL PROTECTED]> -- --- linux-2.6.21-

[PATCH] [ipv6]: adjust inet6_exit() cleanup sequence against inet6_init()

2007-02-09 Thread Joe Jin

() called first, then call addrconf_init(). Signed-off-by: Joe Jin <[EMAIL PROTECTED]> --- linux-2.6.20/net/ipv6/af_inet6.c.orig 2007-02-09 14:05:10.0 +0800 +++ linux-2.6.20/net/ipv6/af_inet6.c2007-02-09 14:21:54.0 +0800 @@ -930,25 +930,28 @@ { /* First

Why not set siginfo at do_general_protection()?

2007-01-18 Thread Joe Jin

Hi, I have a question, while a general protection occured, do_general_protection() should be called, to other faults, it always set siginfo struct, but at this function, it just send a SIGSEGV by force_sig() call but not create a siginfo struct, at send_signal(), it check the siginfo's value and

[PATCH] slip: Replace kmalloc() + memset() pairs with the appropriate kzalloc() calls

2007-01-15 Thread joe jin

This patch replace kmalloc() + memset() pairs with the appropriate kzalloc(). Signed-off-by: Joe Jin <[EMAIL PROTECTED]> --- drivers/net/slip.c.orig 2007-01-16 14:21:52.0 +0800 +++ drivers/net/slip.c 2007-01-16 14:23:07.0 +0800 @@ -1343,15 +1343,12 @@

Re: [PATCH] bonding: Replace kmalloc() + memset() pairs with the appropriate kzalloc() calls

2007-01-14 Thread joe jin

> Hi Joe, > > On 1/12/07, joe jin <[EMAIL PROTECTED]> wrote: > > @@ -788,7 +786,7 @@ static int rlb_initialize(struct bonding > > > > spin_lock_init(&(bond_info->rx_hashtbl_lock)); > > > > - new_hashtbl = kmalloc(size, GFP_KERNEL)

[PATCH] bonding: Replace kmalloc() + memset() pairs with the appropriate kzalloc() calls

2007-01-11 Thread joe jin

This patch replace kmalloc() + memset() pairs with the appropriate kzalloc() calls. Signed-off-by: Joe Jin <[EMAIL PROTECTED]> -- diff -urNp old/drivers/net/bonding/bond_alb.c new/drivers/net/bonding/bond_alb.c --- old/drivers/net/bonding/bond_alb.c 2006-11-30 05:57:37.0 +0800 +

Re: BUG: sleeping function called from invalid context at kernel/rwsem.c:20

2007-01-08 Thread Joe Jin

On 1/6/07, Adrian Bunk <[EMAIL PROTECTED]> wrote: On Thu, Dec 21, 2006 at 11:07:27AM +0800, Joe Jin wrote: Thanks for your report. Is this issue still present in kernel 2.6.20-rc3? And was it already present in kernel 2.6.19? It is not occur again, just appeared this time. At 2.6.19

94 matches

Mail list logo