[LEDE-DEV] [PATCH v2] ramips: add support for Ubiquiti EdgeRouter X-SFP

p.wassi at gmx.at p.wassi at gmx.at
Mon Jun 12 11:14:42 PDT 2017


Ok - I've got some interesting news.
First, the stalls started to appear on multiple CPUs/tasks simultaneously:
(find an excerpt from dmesg under [1]).

For further testing, I've disabled SQM, rebooted and whoaa...
These kernel errors are gone. Running fine since 13 hours.
So it seems that it's not an issue with the Edgerouters but with SQM.

My SQM configuration was basically just using cake + piece_of_cake.qos,
but that's clearly off topic for now. (I'm also CC'ing this mail to Toke,
the maintainer of sqm-scripts).

Regards,
P. Wassi

[1]:
> [  260.610000] Task dump for CPU 2:
> [  334.230000] Task dump for CPU 2:
> [  399.340000] Task dump for CPU 2:
> [  579.390000] Task dump for CPU 2:
> [13074.720000] Task dump for CPU 2:
> [13074.850000] Task dump for CPU 3:
> [30220.460000] Task dump for CPU 2:
> [30220.590000] Task dump for CPU 3:
> [52142.070000] Task dump for CPU 2:
> [52142.200000] Task dump for CPU 3:
> [59972.980000] Task dump for CPU 2:
> [59973.110000] Task dump for CPU 3:
> [70239.020000] Task dump for CPU 2:
> [70239.150000] Task dump for CPU 3:
> [93181.850000] Task dump for CPU 2:
> [93181.980000] Task dump for CPU 3:
> [113636.630000] Task dump for CPU 2:
> [113636.760000] Task dump for CPU 3:
> [136534.460000] Task dump for CPU 2:
> [136534.590000] Task dump for CPU 3:
> [156163.230000] Task dump for CPU 2:
> [156163.360000] Task dump for CPU 3:
> [173499.280000] Task dump for CPU 2:
> [173499.410000] Task dump for CPU 3:
> [173873.730000] Task dump for CPU 2:
> [173873.860000] Task dump for CPU 3:
> [181271.620000] Task dump for CPU 0:
> [181271.620000] Task dump for CPU 2:
> [181271.620000] Task dump for CPU 3:
> [181271.620000] Task dump for CPU 0:
> [189143.290000] Task dump for CPU 2:
> [189143.420000] Task dump for CPU 3:
> [207190.150000] Task dump for CPU 2:
> [207190.280000] Task dump for CPU 3:


----------------------

> 
> On 09/06/17 08:48, p.wassi at gmx.at wrote:
> > Hi guys,
> >
> > I may be hijacking this specific thread, but as 'testing' was mentioned here...
> > I'm running an Edgerouter X since yesterday (not the -SPF version!) on LEDE r4356
> > and am getting these kernel errors/warnings every five minutes or so:
> 
> Hi,
> 
> can you try with v4.9 please ?
> 
>      John
> 
> >> [  470.410000] INFO: rcu_sched detected stalls on CPUs/tasks:
> >> [  470.420000] 	1-...: (127 GPs behind) idle=ee6/0/0 softirq=9641/9643 fqs=1
> >> [  470.430000] 	(detected by 3, t=6004 jiffies, g=898, c=897, q=606)
> >> [  470.440000] Task dump for CPU 1:
> >> [  470.450000] swapper/1       R running      0     0      1 0x00100000
> >> [  470.460000] Stack : 00000000 0e406087 0000006d ffffffff 00000061 7795a2c0 804df2a4 80490000
> >> [  470.460000] 	  8048c75c 00000001 00000001 8048c540 8048c724 80490000 00000000 800135e4
> >> [  470.460000] 	  1100fc03 00000003 8fc70000 8fc71ec0 80490000 8005ecc8 1100fc03 00000001
> >> [  470.460000] 	  00000000 80490000 804df2a4 8005ecc0 80490000 8001b1a8 1100fc03 00000000
> >> [  470.460000] 	  00000004 8048c4a0 000000a0 8001b1b0 b6eeeefd a02eacbf 5d3cafc3 7dbbdccc
> >> [  470.460000] 	  ...
> >> [  470.530000] Call Trace:
> >> [  470.540000] [<8000be98>] __schedule+0x574/0x758
> >> [  470.550000] [<800135e4>] r4k_wait_irqoff+0x0/0x20
> >> [  470.550000]
> >> [  470.560000] rcu_sched kthread starved for 6016 jiffies! g898 c897 f0x0 s3 ->state=0x1
> > Kernel is 4.4.71
> >
> > Also, just while editing a config file the router rebooted.
> > Does someone else also have this issue?
> >
> > Best regards,
> > P. Wassi
> >
> >
> >> On 07/06/17 12:10, John Crispin wrote:
> >>
> >>
> >> On 07/06/17 01:36, Sven Roederer wrote:
> >>> John,
> >>>
> >>> just checked with master build f500799 as initrd-kernel. Looks fine as
> >>> I can see from bootlog.
> >>> Anything special to test?
> >>>
> >>> [    1.710000] MediaTek Nand driver init, version v2.1 Fix AHB virt2phys error
> >>> [    1.720000] Allocate 16 byte aligned buffer: 80592f90
> >>> [    1.730000] Enable NFI Clock
> >>> [    1.740000] # MTK NAND # : Use HW ECC
> >>> [    1.740000] Device found in MTK table, ID: 1da, EXT_ID: 909546
> >>> [    1.760000] Support this Device in MTK table! 1da
> >>> [    1.770000] nand: device found, Manufacturer ID: 0x01, Chip ID: 0xda
> >>> [    1.780000] nand: AMD/Spansion NAND 256MiB 3,3V 8-bit
> >>> [    1.790000] nand: 256 MiB, SLC, erase size: 128 KiB, page size: 2048, OOB size: 128
> >>> [    1.800000] [NAND]select ecc bit:12, sparesize :112 spare_per_sector=28
> >>> [    1.820000] Scanning device for bad blocks
> >>> [    2.520000] 6 ofpart partitions found on MTD device MT7621-NAND
> >>> [    2.530000] Creating 6 MTD partitions on "MT7621-NAND":
> >>> [    2.540000] 0x000000000000-0x000000080000 : "u-boot"
> >>> [    2.550000] 0x000000080000-0x0000000e0000 : "u-boot-env"
> >>> [    2.560000] 0x0000000e0000-0x000000140000 : "factory"
> >>> [    2.570000] 0x000000140000-0x000000440000 : "kernel1"
> >>> [    2.580000] 0x000000440000-0x000000740000 : "kernel2"
> >>> [    2.590000] 0x000000740000-0x00000ff00000 : "ubi"
> >>> [    2.610000] [mtk_nand] probe successfully!
> >>> [    2.610000] Signature matched and data read!
> >>> [    2.620000] load_fact_bbt success 2047
> >>>
> >>>
> >>> Sven
> >> Thanks for testing, i just wanted a 2nd test result for the nand support
> >> on v4.9. i think i'll just switch ramips to v4.9 and hope all goes well.
> >>
> >>       John
> >>
> >> _______________________________________________
> >> Lede-dev mailing list
> >> Lede-dev at lists.infradead.org
> >> http://lists.infradead.org/mailman/listinfo/lede-dev
> >>
> > _______________________________________________
> > Lede-dev mailing list
> > Lede-dev at lists.infradead.org
> > http://lists.infradead.org/mailman/listinfo/lede-dev
> 
> 
> _______________________________________________
> Lede-dev mailing list
> Lede-dev at lists.infradead.org
> http://lists.infradead.org/mailman/listinfo/lede-dev
> 



More information about the Lede-dev mailing list