pcp
[Top] [All Lists]

Re: [performancecopilot/pcp] pmcd causes complete system lockup on CentO

To: performancecopilot/pcp <pcp@xxxxxxxxxxxxxxxxxx>
Subject: Re: [performancecopilot/pcp] pmcd causes complete system lockup on CentOS 7 on VMware (#107)
From: Jeff White <notifications@xxxxxxxxxx>
Date: Tue, 16 Aug 2016 11:46:28 -0700
Delivered-to: pcp@xxxxxxxxxxx
Dkim-signature: v=1; a=rsa-sha1; c=relaxed; d=github.com; h=from:reply-to:to:in-reply-to:references:subject:mime-version:content-type:content-transfer-encoding:list-id:list-archive:list-post:list-unsubscribe; s=s20150108; bh=om6KK/Z4d2nfUg4l2rW56OLUggY=; b=lOSc059sUtooZN0k m0mrm5lz4ZPmJr1V3/rp0Rli6rj8qXVvIo4nTNA+ZgM+o/QClVRAex1x8bUtG2YC tdoThe/p9h7TKMKP65UK25sruhhfDuq1DPipieH8z4b9yNyK43Czly3zwYD2TqVp be69JLwxNYg0erKDaadRg/8JuWo=
In-reply-to: <performancecopilot/pcp/issues/107@xxxxxxxxxx>
List-archive: https://github.com/performancecopilot/pcp
List-id: performancecopilot/pcp <pcp.performancecopilot.github.com>
List-post: <mailto:reply+00bd08b63c751db6908ca4d8a44dd0dd061e0905123ff01592cf0000000113cb218492a169ce0a350886@reply.github.com>
List-unsubscribe: <mailto:unsub+00bd08b63c751db6908ca4d8a44dd0dd061e0905123ff01592cf0000000113cb218492a169ce0a350886@reply.github.com>, <https://github.com/notifications/unsubscribe/AL0ItvhLdsfMkDd0KlH-ChcrqWXa08UNks5qggWEgaJpZM4JktgK>
References: <performancecopilot/pcp/issues/107@xxxxxxxxxx>
Reply-to: performancecopilot/pcp <reply+00bd08b63c751db6908ca4d8a44dd0dd061e0905123ff01592cf0000000113cb218492a169ce0a350886@xxxxxxxxxxxxxxxx>

@natoscott, hardware issue is unlikely as we're talking about VMs hanging when pmcd runs and numerous physical machines exhibiting off behavior like hanging on shutdown. I agree on the kernel being a likely source of the problem as this is indeed a kernel crash. pmcd is somehow triggering it. After much fighting I have been able to get kdump working and I have crash dumps of the problem. One of them is here:

https://drive.google.com/open?id=0B6emnuNXtougZmhGSWhrS2ZJZVU

I also have this poorly made screencast demonstrating the behavior by simply starting pmcd:

https://youtu.be/Gq25aZrWodg

I'm not much of a kernel engineer so I'm of limited help at this point.


You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub, or mute the thread.

<Prev in Thread] Current Thread [Next in Thread>