On Thu, 2010-01-14 at 14:41 -0600, Joel Schopp wrote:
> In the previous patch the client-architecture field for the number of
> cores supported is set statically as high as is possible.  However, that
> static setting could be too high if the system supports smt, resulting
> in cpus assigned to Linux that are not booted.  This patch reads the
> device tree (before it is unflattened) to determine the amount of smt.
> It then dynamically updates the entires in the array with the proper
> number of cores supported.  Tests show this correctly detecting SMT4 on
> a Power7 and still booting all the supported cores on a large machine.
Same comments about submission format as the previous patch...

> Signed-off-by:Joel Schopp<jsch...@austin.ibm.com> 
> Index: linux-2.6.git/arch/powerpc/kernel/prom_init.c
> ===================================================================
> --- linux-2.6.git.orig/arch/powerpc/kernel/prom_init.c
> +++ linux-2.6.git/arch/powerpc/kernel/prom_init.c
> @@ -141,6 +141,8 @@ typedef u32 cell_t;
>  
>  extern void __start(unsigned long r3, unsigned long r4, unsigned long r5);
>  
> +static int __init prom_smt_way(void);

Do you really need a forward declaration ? We generally avoid those and
prefer instead having the functions in the right order to make it
unnecessary.

>  #ifdef CONFIG_PPC64
>  extern int enter_prom(struct prom_args *args, unsigned long entry);
>  #else
> @@ -811,9 +813,17 @@ static void __init prom_send_capabilitie
>  {
>       ihandle elfloader, root;
>       prom_arg_t ret;
> +     u32 *cores;
>  
>       root = call_prom("open", 1, 1, ADDR("/"));
>       if (root != 0) {
> +             /*
> +              * If you add to the struct, please be sure the 100 index
> +              * didn't change.  The BUILD_BUG_ON is a reminder.
> +              */
> +             BUILD_BUG_ON(sizeof(ibm_architecture_vec) != 108);

This is indeed a bit fishy... a nicer way may have been to have the
vector in an asm file with labels but that's probably overkill. Just
maybe add a runtime test that checks the value read initially is
NR_CORES and if not, print a big fat warning ?

> +             cores = (u32 *) &ibm_architecture_vec[100];
> +             *cores = (u32) (NR_CPUS/prom_smt_way());

The style is a bit gross though I suppose it will do. Drop the cast on
the second line, it's not useful, and stick some spaces around that
division. Also maybe print out a message saying to what value you
adjusted the max number of supported cores, might be useful on the field
to diagnose issues.

>               /* try calling the ibm,client-architecture-support method */
>               prom_printf("Calling ibm,client-architecture-support...");
>               if (call_prom_ret("call-method", 3, 2, &ret,
> @@ -1031,6 +1041,45 @@ static void __init reserve_mem(u64 base,
>       RELOC(mem_reserve_cnt) = cnt + 1;
>  }
>  
> +
> +static int __init prom_smt_way(void)
> +{
> +     phandle node;
> +     char type[64];
> +     unsigned int plen;
> +
> +     for (node = 0; prom_next_node(&node); ) {
> +             type[0] = 0;
> +             prom_getprop(node, "device_type", type, sizeof(type));
> +
> +             if (type[0] == 0) {
> +                     /*
> +                      * CHRP Longtrail machines have no device_type
> +                      * on the memory node, so check the name instead...
> +                      */
> +                     prom_getprop(node, "name", type, sizeof(type));

They also have no SMT :-) Just continue instead. You could also have
instead gone for /cpus and peeked at the first child. Might have lead to
a simpler construct (and faster runtime)

> +             }
> +             if (strcmp(type, RELOC("cpu")))
> +                     continue;
> +
> +             /*
> +              * There is an entry for each smt thread, each entry being
> +              * 4 bytes long.  All cpus should have the same number of
> +              * smt threads, so return after finding the first.
> +              */
> +             plen = prom_getproplen(node, "ibm,ppc-interrupt-server#s");
> +             prom_debug("smt %x\n", (unsigned long) plen);

Might only be a debug message but it should be a tad more verbose about
what it's actually printing.

> +             if (plen >= 4)
> +                     return plen / 4;
> +     }
> +     /*
> +      * If things go wrong and we get here fallback to SMT1
> +      */
> +     prom_debug("unable to determine smt from device tree, guessing smt1\n");
> +     return 1;
> +
> +}
> +

Cheers,
Ben.

>  /*
>   * Initialize memory allocation mechanism, parse "memory" nodes and
>   * obtain that way the top of memory and RMO to setup out local allocator
> 
> 
> _______________________________________________
> Linuxppc-dev mailing list
> Linuxppc-dev@lists.ozlabs.org
> https://lists.ozlabs.org/listinfo/linuxppc-dev


_______________________________________________
Linuxppc-dev mailing list
Linuxppc-dev@lists.ozlabs.org
https://lists.ozlabs.org/listinfo/linuxppc-dev

Reply via email to