On Mon, 2015-18-05 at 15:18:04 UTC, Vipin K Parashar wrote:
> This patch adds support for FSP EPOW (Early Power Off Warning) and

Please spell out the acronyms the first time you use them, including FSP.

> DPO (Delayed Power Off) events for PowerNV platform. EPOW events are
                                    ^
                                    the

> generated by SPCN/FSP due to various critical system conditions that

SPCN?

> need system shutdown. Few examples of these conditions are high
                        ^
s/need/require/ ?       A few

> ambient temperature or system running on UPS power with low UPS battery.
> DPO event is generated in response to admin initiated system request.

Blank line between paragraphs please.

>       Upon receipt of EPOW and DPO events host kernel invokes
                                            ^
                                            the host kernel

> orderly_poweroff for performing graceful system shutdown. System admin

I like it if you spell functions with a trailing () to make it clear they are
functions, so this would be "orderly_powerof()".

> can also add systemd service shutdown scripts to perform any specific
> actions like graceful guest shutdown upon system poweroff. libvirt-guests
> is systemd service available on recent distros for management of guests
> at system start/shutdown time.

This last part about the scripts is not relevant to the kernel patch so just
leave it out please.

> 
> Signed-off-by: Vipin K Parashar <vi...@linux.vnet.ibm.com>
> Reviewed-by: Joel Stanley <j...@jms.id.au>
> Reviewed-by: Vaibhav Jain <vaib...@linux.vnet.ibm.com>
> ---
>  arch/powerpc/include/asm/opal-api.h            |  44 ++++++++
>  arch/powerpc/include/asm/opal.h                |   3 +-
>  arch/powerpc/platforms/powernv/opal-power.c    | 147 
> ++++++++++++++++++++++---
>  arch/powerpc/platforms/powernv/opal-wrappers.S |   1 +
>  4 files changed, 179 insertions(+), 16 deletions(-)
> 
> diff --git a/arch/powerpc/include/asm/opal-api.h 
> b/arch/powerpc/include/asm/opal-api.h
> index 0321a90..90fa364 100644
> --- a/arch/powerpc/include/asm/opal-api.h
> +++ b/arch/powerpc/include/asm/opal-api.h
> @@ -355,6 +355,10 @@ enum opal_msg_type {
>       OPAL_MSG_TYPE_MAX,
>  };
>  
> +/* OPAL_MSG_SHUTDOWN parameter values */
> +#define      SOFT_OFF        0x00
> +#define      SOFT_REBOOT     0x01

I don't see this in the skiboot version of opal-api.h ?

They should be kept in sync.

If it's a Linux only define it should go in opal.h

>  struct opal_msg {
>       __be32 msg_type;
>       __be32 reserved;
> @@ -730,6 +734,46 @@ struct opal_i2c_request {
>       __be64 buffer_ra;               /* Buffer real address */
>  };
>  
> +/*
> + * EPOW status sharing (OPAL and the host)
> + *
> + * The host will pass on OPAL, a buffer of length OPAL_SYSEPOW_MAX
> + * with individual elements being 16 bits wide to fetch the system
> + * wide EPOW status. Each element in the buffer will contain the
> + * EPOW status in it's bit representation for a particular EPOW sub
> + * class as defiend here. So multiple detailed EPOW status bits
> + * specific for any sub class can be represented in a single buffer
> + * element as it's bit representation.
> + */
> +
> +/* System EPOW type */
> +enum OpalSysEpow {
> +     OPAL_SYSEPOW_POWER      = 0,    /* Power EPOW */
> +     OPAL_SYSEPOW_TEMP       = 1,    /* Temperature EPOW */
> +     OPAL_SYSEPOW_COOLING    = 2,    /* Cooling EPOW */
> +     OPAL_SYSEPOW_MAX        = 3,    /* Max EPOW categories */
> +};
> +
> +/* Power EPOW */
> +enum OpalSysPower {
> +     OPAL_SYSPOWER_UPS       = 0x0001, /* System on UPS power */
> +     OPAL_SYSPOWER_CHNG      = 0x0002, /* System power config change */
> +     OPAL_SYSPOWER_FAIL      = 0x0004, /* System impending power failure */
> +     OPAL_SYSPOWER_INCL      = 0x0008, /* System incomplete power */
> +};
> +
> +/* Temperature EPOW */
> +enum OpalSysTemp {
> +     OPAL_SYSTEMP_AMB        = 0x0001, /* System over ambient temperature */
> +     OPAL_SYSTEMP_INT        = 0x0002, /* System over internal temperature */
> +     OPAL_SYSTEMP_HMD        = 0x0004, /* System over ambient humidity */
> +};
> +
> +/* Cooling EPOW */
> +enum OpalSysCooling {
> +     OPAL_SYSCOOL_INSF       = 0x0001, /* System insufficient cooling */
> +};

I don't see the last three of these enums used at all, so please drop them.

>  #endif /* __ASSEMBLY__ */
>  
>  #endif /* __OPAL_API_H */
> diff --git a/arch/powerpc/include/asm/opal.h b/arch/powerpc/include/asm/opal.h
> index 042af1a..d30766f 100644
> --- a/arch/powerpc/include/asm/opal.h
> +++ b/arch/powerpc/include/asm/opal.h
> @@ -141,7 +141,8 @@ int64_t opal_pci_fence_phb(uint64_t phb_id);
>  int64_t opal_pci_reinit(uint64_t phb_id, uint64_t reinit_scope, uint64_t 
> data);
>  int64_t opal_pci_mask_pe_error(uint64_t phb_id, uint16_t pe_number, uint8_t 
> error_type, uint8_t mask_action);
>  int64_t opal_set_slot_led_status(uint64_t phb_id, uint64_t slot_id, uint8_t 
> led_type, uint8_t led_action);
> -int64_t opal_get_epow_status(__be64 *status);
> +int64_t opal_get_epow_status(uint16_t *status, uint16_t *length);

Has the signature of this function really changed or was it just wrong before?

If it's changed how do we know we're running on a version of OPAL that supports
the two argument version?

The parameter names don't seem very clear either. status is actually a pointer
to an array of "statuses", and length is the number of entries in that array.

Also you removed the endian annotations but then you pass it __be16 values, so
that looks incorrect, you should be using __be16 here.

> +int64_t opal_get_dpo_status(int64_t *dpo_timeout);

Similarly this should be __be64 AFAICS.

> diff --git a/arch/powerpc/platforms/powernv/opal-power.c 
> b/arch/powerpc/platforms/powernv/opal-power.c
> index ac46c2c..581bbd8 100644
> --- a/arch/powerpc/platforms/powernv/opal-power.c
> +++ b/arch/powerpc/platforms/powernv/opal-power.c
> @@ -1,5 +1,5 @@
>  /*
> - * PowerNV OPAL power control for graceful shutdown handling
> + * PowerNV support for OPAL power-control, poweroff events
>   *
>   * Copyright 2015 IBM Corp.
>   *
> @@ -9,18 +9,87 @@
>   * 2 of the License, or (at your option) any later version.
>   */
>  
> +#define pr_fmt(fmt)  "OPAL-POWER: "  fmt

Please don't shout, "opal-power" is fine.

>  #include <linux/kernel.h>
> +#include <linux/spinlock.h>
> +#include <linux/timer.h>

Don't think you need those?

>  #include <linux/reboot.h>
> -#include <linux/notifier.h>
> -

I think you DO need notifier.h.

> +#include <linux/of.h>
>  #include <asm/opal.h>
>  #include <asm/machdep.h>
>  
> -#define SOFT_OFF 0x00
> -#define SOFT_REBOOT 0x01
> +/* Get EPOW status */
> +static bool get_epow_status(void)

This is not a great name, "get" implies it gives you something back, but it
doesn't it just tells you true or false.

So maybe epow_event_pending() ?

> +{
> +     int i;
> +     u16 num_classes;
> +     __be16 epow_classes;

I think this would be cleaner if you just had a single num_classes and you
endian swap in the one place you use it.

> +     __be16 opal_epow_status[OPAL_SYSEPOW_MAX] = {0};
> +
> +     /* Send kernel EPOW classes supported info to OPAL */
> +     epow_classes = cpu_to_be16(OPAL_SYSEPOW_MAX);
> +
> +     /* Get EPOW events information from OPAL */
> +     opal_get_epow_status(opal_epow_status, &epow_classes);

This could fail.

> +
> +     /* Look for EPOW events present */
> +     num_classes = be16_to_cpu(epow_classes);
> +     for (i = 0; i < num_classes; i++) {
> +             if (be16_to_cpu(opal_epow_status[i]))
> +                     return true;
> +     }
> +
> +     return false;
> +}
> +
> +/* Process existing EPOW, DPO events */
> +static void process_existing_poweroff_events(void)
> +{
> +     int rc;
> +     __be64 opal_dpo_timeout;
>  
> +     /* Check for DPO event */
> +     rc = opal_get_dpo_status(&opal_dpo_timeout);
> +     if (rc != OPAL_WRONG_STATE) {
> +             pr_info("Existing DPO event detected. Powering off system\n");
> +             goto poweroff;
> +     }
> +
> +     /* Check for EPOW event */
> +     if (get_epow_status()) {
> +             pr_info("Existing EPOW event detected. Powering off system");
> +             goto poweroff;
> +     }
> +     return;
> +
> +poweroff:
> +     orderly_poweroff(true);

I don't like that much, you shouldn't need to use goto for such simple logic.

Can you create a single function, maybe called event_pending(), and have it
check both EPOW and DPO and return a bool if there's any kind of event pending.

Then this can just become:

  if (event_pending())
        orderly_poweroff(true);

> +}
> +
> +/* OPAL EPOW, DPO event notifier */
> +static int opal_epow_dpo_event(struct notifier_block *nb,
> +                             unsigned long msg_type, void *msg)
> +{
> +     switch (msg_type) {
> +     case OPAL_MSG_EPOW:
> +             pr_info("EPOW msg received. Powering off system\n");
> +             break;
> +     case OPAL_MSG_DPO:
> +             pr_info("DPO msg received. Powering off system\n");
> +             break;
> +     default:
> +             pr_err("Unknown message type %lu\n", msg_type);
> +             return 0;
> +     }
> +
> +     orderly_poweroff(true);
> +     return 0;
> +}

Why do we need a separate notifier function? Can't this just be folded into
opal_power_control_event() ?

> +
> +/* OPAL power-control events notifier */
>  static int opal_power_control_event(struct notifier_block *nb,
> -                                 unsigned long msg_type, void *msg)
> +                                     unsigned long msg_type, void *msg)
>  {
>       struct opal_msg *power_msg = msg;
>       uint64_t type;
> @@ -29,20 +98,35 @@ static int opal_power_control_event(struct notifier_block 
> *nb,
>  
>       switch (type) {
>       case SOFT_REBOOT:
> -             pr_info("OPAL: reboot requested\n");
> +             pr_info("Reboot requested\n");
>               orderly_reboot();
>               break;
>       case SOFT_OFF:
> -             pr_info("OPAL: poweroff requested\n");
> +             pr_info("Poweroff requested\n");
>               orderly_poweroff(true);
>               break;
>       default:
> -             pr_err("OPAL: power control type unexpected %016llx\n", type);
> +             pr_err("Unknown power-control type %llu\n", type);
>       }
>  
>       return 0;
>  }
>  
> +/* OPAL EPOW event notifier block */
> +static struct notifier_block opal_epow_nb = {
> +     .notifier_call  = opal_epow_dpo_event,
> +     .next           = NULL,
> +     .priority       = 0,
> +};
> +
> +/* OPAL DPO event notifier block */
> +static struct notifier_block opal_dpo_nb = {
> +     .notifier_call  = opal_epow_dpo_event,
> +     .next           = NULL,
> +     .priority       = 0,
> +};
> +
> +/* OPAL power-control event notifier block */
>  static struct notifier_block opal_power_control_nb = {
>       .notifier_call  = opal_power_control_event,
>       .next           = NULL,
> @@ -51,16 +135,49 @@ static struct notifier_block opal_power_control_nb = {
>  
>  static int __init opal_power_control_init(void)
>  {
> -     int ret;
> +     int ret, epow_dpo_supported = 0;

Can you make that a bool and call it "supported".

> +     struct device_node *node_epow;

It's typical to just call it "np".

>  
> +     /* Register OPAL power-control events notifier */
>       ret = opal_message_notifier_register(OPAL_MSG_SHUTDOWN,
> -                                          &opal_power_control_nb);
> -     if (ret) {
> -             pr_err("%s: Can't register OPAL event notifier (%d)\n",
> -                             __func__, ret);
> -             return ret;
> +                             &opal_power_control_nb);
> +     if (ret)
> +             pr_err("Power-control events notifier registration "
> +                             "failed, ret = %d\n", ret);

Please don't split the string, and similarly below.

> +
> +     /* Determine EPOW, DPO support in hardware. */
> +     node_epow = of_find_node_by_path("/ibm,opal/epow");
> +     if (node_epow) {
> +             epow_dpo_supported = of_device_is_compatible(node_epow,
> +                             "ibm,opal-v3-epow");
> +             of_node_put(node_epow);
>       }
>  
> +     if (epow_dpo_supported)
> +             pr_info("OPAL EPOW, DPO support detected.\n");
> +     else
> +             return 0;

Clearer as:

        if (!supported)
                return 0;

        pr_info("OPAL EPOW, DPO support detected.\n");

> +
> +     /* Register EPOW event notifier */
> +     ret = opal_message_notifier_register(OPAL_MSG_EPOW,
> +                     &opal_epow_nb);
> +     if (ret)
> +             pr_err("EPOW event notifier registration failed, "
> +                             "ret = %d\n", ret);
> +
> +     /* Register DPO event notifier */
> +     ret = opal_message_notifier_register(OPAL_MSG_DPO,
> +                     &opal_dpo_nb);
> +     if (ret)
> +             pr_err("DPO event notifier registration failed, "
> +                             "ret = %d\n", ret);
> +
> +     /* Check for any existing EPOW or DPO events. */
> +     process_existing_poweroff_events();
> +
> +     pr_info("Poweroff events support initialized\n");
> +
>       return 0;
>  }
> +

No extra blank line thanks.

>  machine_subsys_initcall(powernv, opal_power_control_init);


cheers
_______________________________________________
Linuxppc-dev mailing list
Linuxppc-dev@lists.ozlabs.org
https://lists.ozlabs.org/listinfo/linuxppc-dev

Reply via email to