Guys,

I want to pay your attention that we need to not only fix snapshots generation issue, but also prevent caused by it unexpected services failures (see details in a duplicate [0] of original [1] bug), which would become a challenge for not experienced users (for example he/she won't be able to authenticate in GUI or CLI some time after snapshot generation is started). I get this issue on our bare-metal lab (10 slaves) each time I have an environment which is running more than 2 days.

Links usage for files copying doesn't help in such case, because tarball is still saved on /var partition. Also, if I want to workaround this issue, I have to perform a lot of actions: s hrink 'os-varlog' volume (because it's the biggest [2] one) in order to increase unallocated disk space, resize its FS, create new volume, create FS, mount it to /var/www/nailgun/dump and update fstab. Not easy way to make "Generate Diagnostic Snapshot " button work, right?

So, if we are going to address this diagnostic snapshot issue in 8.0, I want to remind you about b) option :)

b) Make the snapshot location share the diskspace of /var/log?

Thanks!

[0] https://bugs.launchpad.net/fuel/+bug/1530131
[1] https://bugs.launchpad.net/fuel/+bug/1529182
[2] http://paste.openstack.org/show/484895/

On 18.01.16 13:20, Maciej Kwiek wrote:
Igor: It seems that fqdn -> ipaddr will indeed be resolved. Please share your feedback in review: https://review.openstack.org/#/c/266964/3

On Fri, Jan 15, 2016 at 4:25 PM, Igor Kalnitsky <ikalnit...@mirantis.com <mailto:ikalnit...@mirantis.com>> wrote:

    Sheena -

    What do you mean by *targeted*? Shotgun's designed to be a *targeted*
    solution. If someone wants more *precise* targets - it's easy to
    specify them in Nailgun's settings.yaml.

    - Igor

    On Fri, Jan 15, 2016 at 5:02 PM, Sheena Gregson
    <sgreg...@mirantis.com <mailto:sgreg...@mirantis.com>> wrote:
    > I’ve also seen the request multiple times to be able to provide more
    > targeted snapshots which might also (partially) solve this
    problem as it
    > would require significantly less disk space to grab logs from a
    subset of
    > nodes for a specific window of time, instead of the more robust
    grab-all
    > solution we have now.
    >
    >
    >
    > From: Maciej Kwiek [mailto:mkw...@mirantis.com
    <mailto:mkw...@mirantis.com>]
    > Sent: Thursday, January 14, 2016 5:59 AM
    > To: OpenStack Development Mailing List (not for usage questions)
    > <openstack-dev@lists.openstack.org
    <mailto:openstack-dev@lists.openstack.org>>
    > Subject: Re: [openstack-dev] [Fuel] Diagnostic snapshot
    generation is broken
    > due to lack of disk space
    >
    >
    >
    > Igor,
    >
    >
    >
    > I will investigate this, thanks!
    >
    >
    >
    > Artem,
    >
    >
    >
    > I guess that if we have an untrusted user on master node, he
    could just put
    > something he wants to be in the snapshot in /var/log without
    having to time
    > the attack carefully with tar execution.
    >
    >
    >
    > I want to use links for directories, this saves me the trouble
    of creating
    > hardlinks for every single file in the directory. Although with how
    > exclusion is currently implemented it can cause deleting log
    files from
    > original directories, need to check this out.
    >
    >
    >
    > About your PS: whole /var/log on master node (not in container)
    is currently
    > downloaded, I think we shouldn't change this as we plan to drop
    containers
    > in 9.0.
    >
    >
    >
    > Cheers,
    >
    > Maciej
    >
    >
    >
    > On Thu, Jan 14, 2016 at 12:32 PM, Artem Panchenko
    <apanche...@mirantis.com <mailto:apanche...@mirantis.com>>
    > wrote:
    >
    > Hi,
    >
    > using symlinks is a bit dangerous, here is a quote from the man you
    > mentioned [0]:
    >
    >> The `--dereference' option is unsafe if an untrusted user can
    modify
    >> directories while tar is running.
    >
    > Hard links usage is much safer, because you can't use them for
    directories.
    > But at the same time implementation in shotgun would be more
    complicated
    > than with symlinks.
    >
    > Anyway, in order to determine what linking to use we need to
    decide where
    > (/var/log or another partition) diagnostic snapshot will be stored.
    >
    > p.s.
    >
    >>This doesn't really give us much right now, because most of the
    logs are
    >> fetched from master node via ssh due to shotgun being run in
    mcollective
    >> container
    >
    >
    >
    > AFAIK '/var/log/docker-logs/' is available from mcollective
    container and
    > mounted to /var/log/:
    >
    > [root@fuel-lab-cz5557 ~]# dockerctl shell mcollective mount -l |
    grep
    > os-varlog
    > /dev/mapper/os-varlog on /var/log type ext4
    > (rw,relatime,stripe=128,data=ordered)
    >
    > From my experience '/var/log/docker-logs/remote' folder is most
    ' heavy'
    > thing in snapshot.
    >
    > [0]
    http://www.gnu.org/software/tar/manual/html_node/dereference.html
    >
    > Thanks!
    >
    >
    >
    > On 14.01.16 13:00, Igor Kalnitsky wrote:
    >
    > I took a glance on Maciej's patch and it adds a switch to tar
    command
    >
    > to make it follow symbolic links
    >
    > Yeah, that should work. Except one thing - we previously had fqdn ->
    >
    > ipaddr links in snapshots. So now they will be resolved into full
    >
    > copy?
    >
    >
    >
    > I meant that symlinks also give us the benefit of not using
    additional
    >
    > space (just as hardlinks do) while being able to link to files from
    >
    > different filesystems.
    >
    > I'm sorry, I got you wrong. :)
    >
    >
    >
    > - Igor
    >
    >
    >
    > On Thu, Jan 14, 2016 at 12:34 PM, Maciej Kwiek
    <mkw...@mirantis.com <mailto:mkw...@mirantis.com>> wrote:
    >
    > Igor,
    >
    >
    >
    > I meant that symlinks also give us the benefit of not using
    additional space
    >
    > (just as hardlinks do) while being able to link to files from
    different
    >
    > filesystems.
    >
    >
    >
    > Also, as Barłomiej pointed out the `h` switch for tar should do
    the trick
    >
    > [1].
    >
    >
    >
    > Cheers,
    >
    > Maciej
    >
    >
    >
    > [1]
    http://www.gnu.org/software/tar/manual/html_node/dereference.html
    >
    >
    >
    > On Thu, Jan 14, 2016 at 11:22 AM, Bartlomiej Piotrowski
    >
    > <bpiotrow...@mirantis.com <mailto:bpiotrow...@mirantis.com>> wrote:
    >
    > Igor,
    >
    >
    >
    > I took a glance on Maciej's patch and it adds a switch to tar
    command to
    >
    > make it follow symbolic links, so it looks good to me.
    >
    >
    >
    > Bartłomiej
    >
    >
    >
    > On Thu, Jan 14, 2016 at 10:39 AM, Igor Kalnitsky
    <ikalnit...@mirantis.com <mailto:ikalnit...@mirantis.com>>
    >
    > wrote:
    >
    > Hey Maceij -
    >
    >
    >
    > About hardlinks - wouldn't it be better to use symlinks?
    >
    > This way we don't occupy more space than necessary
    >
    > AFAIK, hardlinks won't occupy much space. They are the links,
    after all.
    >
    > :)
    >
    >
    >
    > As for symlinks, I'm afraid shotgun (and fabric underneath) won't
    >
    > resolve them and links are get to snapshot As Is. That means if
    there
    >
    > will be no content in the snapshot they are pointing to, they are
    >
    > simply useless. Needs to be checked, though.
    >
    >
    >
    > - Igor
    >
    >
    >
    > On Thu, Jan 14, 2016 at 10:31 AM, Maciej Kwiek
    <mkw...@mirantis.com <mailto:mkw...@mirantis.com>>
    >
    > wrote:
    >
    > Thanks for your insight guys!
    >
    >
    >
    > I agree with Oleg, I will see what I can do to make this work
    this way.
    >
    >
    >
    > About hardlinks - wouldn't it be better to use symlinks? This way we
    >
    > don't
    >
    > occupy more space than necessary, and we can link to files and
    >
    > directories
    >
    > that are in other block device than /var. Please see [1] review
    for a
    >
    > proposed change that introduces symlinks.
    >
    >
    >
    > This doesn't really give us much right now, because most of the logs
    >
    > are
    >
    > fetched from master node via ssh due to shotgun being run in
    >
    > mcollective
    >
    > container, but it's something! When we remove containers, this will
    >
    > prove
    >
    > more useful.
    >
    >
    >
    > Regards,
    >
    > Maciej Kwiek
    >
    >
    >
    > [1] https://review.openstack.org/#/c/266964/
    >
    >
    >
    > On Tue, Jan 12, 2016 at 1:51 PM, Oleg Gelbukh
    <ogelb...@mirantis.com <mailto:ogelb...@mirantis.com>>
    >
    > wrote:
    >
    > I think we need to find a way to:
    >
    >
    >
    > 1) verify the size of snapshot without actually making it and
    compare
    >
    > to
    >
    > the available disk space beforehand.
    >
    > 2) refuse to create snapshot if space is insufficient and notify
    user
    >
    > (otherwise it breaks Admin node as we have seen)
    >
    > 3) provide a way to prioritize elements of the snapshot and exclude
    >
    > them
    >
    > based on the priorities or user choice.
    >
    >
    >
    > This will allow for better and safer UX with the snapshot.
    >
    >
    >
    > --
    >
    > Best regards,
    >
    > Oleg Gelbukh
    >
    >
    >
    > On Tue, Jan 12, 2016 at 1:47 PM, Maciej Kwiek
    <mkw...@mirantis.com <mailto:mkw...@mirantis.com>>
    >
    > wrote:
    >
    > Hi!
    >
    >
    >
    > I need some advice on how to tackle this issue. There is a bug [1]
    >
    > describing the problem with creating a diagnostic snapshot. The
    issue
    >
    > is
    >
    > that /var/log has 100GB available, while /var (where diagnostic
    >
    > snapshot is
    >
    > being generated - /var/www/nailgun/dump/fuel-snapshot according to
    >
    > [2]) has
    >
    > 10GB available, so dumping the logs can be an issue when logs size
    >
    > exceed
    >
    > free space in /var.
    >
    >
    >
    > There are several things we could do, but I am unsure on which
    course
    >
    > to
    >
    > take. Should we
    >
    > a) Allocate more disk space for /var/www (or for whole /var)?
    >
    > b) Make the snapshot location share the diskspace of /var/log?
    >
    > c) Something else? What?
    >
    >
    >
    > Please share your thoughts on this.
    >
    >
    >
    > Cheers,
    >
    > Maciej Kwiek
    >
    >
    >
    > [1] https://bugs.launchpad.net/fuel/+bug/1529182
    >
    > [2]
    >
    >
    >
    >
    
https://github.com/openstack/fuel-web/blob/2855a9ba925c146b4802ab3cd2185f1dce2d8a6a/nailgun/nailgun/settings.yaml#L717
    >
    >
    >
    >
    >
    >
    >
    >
    __________________________________________________________________________
    >
    > OpenStack Development Mailing List (not for usage questions)
    >
    > Unsubscribe:
    >
    > openstack-dev-requ...@lists.openstack.org?subject:unsubscribe
    <http://openstack-dev-requ...@lists.openstack.org?subject:unsubscribe>
    >
    > http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev
    >
    >
    >
    >
    >
    >
    >
    >
    __________________________________________________________________________
    >
    > OpenStack Development Mailing List (not for usage questions)
    >
    > Unsubscribe:
    >
    > openstack-dev-requ...@lists.openstack.org?subject:unsubscribe
    <http://openstack-dev-requ...@lists.openstack.org?subject:unsubscribe>
    >
    > http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev
    >
    >
    >
    >
    >
    >
    >
    >
    __________________________________________________________________________
    >
    > OpenStack Development Mailing List (not for usage questions)
    >
    > Unsubscribe:
    >
    > openstack-dev-requ...@lists.openstack.org?subject:unsubscribe
    <http://openstack-dev-requ...@lists.openstack.org?subject:unsubscribe>
    >
    > http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev
    >
    >
    >
    >
    >
    >
    __________________________________________________________________________
    >
    > OpenStack Development Mailing List (not for usage questions)
    >
    > Unsubscribe:
    >
    > openstack-dev-requ...@lists.openstack.org?subject:unsubscribe
    <http://openstack-dev-requ...@lists.openstack.org?subject:unsubscribe>
    >
    > http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev
    >
    >
    >
    >
    >
    >
    __________________________________________________________________________
    >
    > OpenStack Development Mailing List (not for usage questions)
    >
    > Unsubscribe:
    openstack-dev-requ...@lists.openstack.org?subject:unsubscribe
    <http://openstack-dev-requ...@lists.openstack.org?subject:unsubscribe>
    >
    > http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev
    >
    >
    >
    >
    >
    >
    __________________________________________________________________________
    >
    > OpenStack Development Mailing List (not for usage questions)
    >
    > Unsubscribe:
    openstack-dev-requ...@lists.openstack.org?subject:unsubscribe
    <http://openstack-dev-requ...@lists.openstack.org?subject:unsubscribe>
    >
    > http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev
    >
    >
    >
    >
    __________________________________________________________________________
    >
    > OpenStack Development Mailing List (not for usage questions)
    >
    > Unsubscribe:
    openstack-dev-requ...@lists.openstack.org?subject:unsubscribe
    <http://openstack-dev-requ...@lists.openstack.org?subject:unsubscribe>
    >
    > http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev
    >
    >
    >
    > --
    >
    > Artem Panchenko
    >
    > QA Engineer
    >
    >
    >
    __________________________________________________________________________
    > OpenStack Development Mailing List (not for usage questions)
    > Unsubscribe:
    openstack-dev-requ...@lists.openstack.org?subject:unsubscribe
    <http://openstack-dev-requ...@lists.openstack.org?subject:unsubscribe>
    > http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev
    >
    >
    >
    >
    >
    __________________________________________________________________________
    > OpenStack Development Mailing List (not for usage questions)
    > Unsubscribe:
    openstack-dev-requ...@lists.openstack.org?subject:unsubscribe
    <http://openstack-dev-requ...@lists.openstack.org?subject:unsubscribe>
    > http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev
    >

    __________________________________________________________________________
    OpenStack Development Mailing List (not for usage questions)
    Unsubscribe:
    openstack-dev-requ...@lists.openstack.org?subject:unsubscribe
    <http://openstack-dev-requ...@lists.openstack.org?subject:unsubscribe>
    http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev




__________________________________________________________________________
OpenStack Development Mailing List (not for usage questions)
Unsubscribe: openstack-dev-requ...@lists.openstack.org?subject:unsubscribe
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev

--
Artem Panchenko
QA Engineer

__________________________________________________________________________
OpenStack Development Mailing List (not for usage questions)
Unsubscribe: openstack-dev-requ...@lists.openstack.org?subject:unsubscribe
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev

Reply via email to