Project

General

Profile

Anomalie #3603

Rédémarrage difficile des vm de coon

Added by Christian P. Momon almost 3 years ago. Updated about 2 years ago.

Status:
Fermé
Priority:
Normale
Assignee:
Christian P. Momon
Category:
-
Target version:
-
Start date:
02/19/2019
Due date:
% Done:

0%

Estimated time:

Description

Suite à un dist-upgrade de coon, le reboot a mal fonctionné : les vm n'ont pas démarré.
Un reboot supplémentaire a permis de tout faire rentrer dans l'ordre.
Le problème a été constaté lors des 2 derniers reboot de cluster.
Est-ce un problème de délai de boot ?

Premier boot (en erreur) :

cpm@ocmstar (23:32:36) ~ 7 > sshapril root@coon.chapril.org
Linux coon.chapril.org 4.9.0-8-amd64 #1 SMP Debian 4.9.144-3 (2019-02-02) x86_64

The programs included with the Debian GNU/Linux system are free software;
the exact distribution terms for each program are described in the
individual files in /usr/share/doc/*/copyright.

Debian GNU/Linux comes with ABSOLUTELY NO WARRANTY, to the extent
permitted by applicable law.
Last login: Mon Feb 18 23:28:17 2019 from 2a01:e35:2fb3:320:8b1:655c:53ff:a404
=(^-^)=root@coon:~# drbdadm primary coon
=(^-^)=root@coon:~# mount /var/lib/libvirt/coon
=(^-^)=root@coon:~# cd /etc/libvirt/qemu
=(^-^)=root@coon:/etc/libvirt/qemu# for host in $(ls *xml | sed -e 's/.xml//g'| grep -v modele) ; do virsh start $host ; done
error: Failed to start domain admin
error: Cannot access storage file '/var/lib/libvirt/maine/admin.qcow2' (as uid:64055, gid:64055): Aucun fichier ou dossier de ce type

error: Failed to start domain bastion
error: Cannot access storage file '/var/lib/libvirt/maine/bastion.qcow2' (as uid:64055, gid:64055): Aucun fichier ou dossier de ce type

error: Failed to start domain dns
error: Requested operation is not valid: network 'default' is not active

error: Failed to start domain lamp
error: Requested operation is not valid: network 'default' is not active

error: Failed to start domain libreoffice
error: Requested operation is not valid: network 'default' is not active

error: Failed to start domain mail
error: Requested operation is not valid: network 'default' is not active

error: Failed to start domain pad
error: Cannot access storage file '/var/lib/libvirt/maine/pad.qcow2' (as uid:64055, gid:64055): Aucun fichier ou dossier de ce type

error: Failed to start domain pouet
error: Cannot access storage file '/var/lib/libvirt/maine/pouet.qcow2' (as uid:64055, gid:64055): Aucun fichier ou dossier de ce type

error: Failed to start domain sympa
error: Requested operation is not valid: network 'default' is not active

=(^-^)=root@coon:/etc/libvirt/qemu# virsh list
 Id    Name                           State
----------------------------------------------------

Deuxième boot (nominal) :

cpm@ocmstar (23:39:45) ~ 10 > sshapril root@coon.chapril.org
Linux coon.chapril.org 4.9.0-8-amd64 #1 SMP Debian 4.9.144-3 (2019-02-02) x86_64

The programs included with the Debian GNU/Linux system are free software;
the exact distribution terms for each program are described in the
individual files in /usr/share/doc/*/copyright.

Debian GNU/Linux comes with ABSOLUTELY NO WARRANTY, to the extent
permitted by applicable law.
Last login: Mon Feb 18 23:32:38 2019 from 2a01:e35:2fb3:320:8b1:655c:53ff:a404
=(^-^)=root@coon:~# drbdadm primary coon
=(^-^)=root@coon:~# mount /var/lib/libvirt/coon
=(^-^)=root@coon:~# cd /etc/libvirt/qemu
=(^-^)=root@coon:/etc/libvirt/qemu# for host in $(ls *xml | sed -e 's/.xml//g'| grep -v modele) ; do virsh start $host ; done
error: Failed to start domain admin
error: Cannot access storage file '/var/lib/libvirt/maine/admin.qcow2' (as uid:64055, gid:64055): Aucun fichier ou dossier de ce type

error: Failed to start domain bastion
error: Cannot access storage file '/var/lib/libvirt/maine/bastion.qcow2' (as uid:64055, gid:64055): Aucun fichier ou dossier de ce type

Domain dns started

Domain lamp started

Domain libreoffice started

Domain mail started

error: Failed to start domain pad
error: Cannot access storage file '/var/lib/libvirt/maine/pad.qcow2' (as uid:64055, gid:64055): Aucun fichier ou dossier de ce type

error: Failed to start domain pouet
error: Cannot access storage file '/var/lib/libvirt/maine/pouet.qcow2' (as uid:64055, gid:64055): Aucun fichier ou dossier de ce type

Domain sympa started


Related issues

Related to Infra Chapril - Anomalie #4601: Redémarrage difficile des vm coonFermé07/15/2020

Actions

History

#1

Updated by Quentin Gibeaux almost 3 years ago

Quand ça a planté, as-tu pensé à vérifier que le serveur avait fini de démarrer ? (Genre journalctl -f)

#2

Updated by Christian P. Momon almost 3 years ago

  • Status changed from Nouveau to En cours de traitement
  • Assignee set to Christian P. Momon

Je confirme n'avoir pas regardé. Je le ferai la prochaine fois.

#3

Updated by Christian P. Momon almost 3 years ago

  • Status changed from En cours de traitement to Attente d'information
#4

Updated by Quentin Gibeaux over 2 years ago

Autre astuce : systemctl status
Ça affiche

State: running
, quand c'est fini de booter.

#5

Updated by Christian P. Momon over 2 years ago

Bien vu. Jusqu'ici je vérifiais dans les logs système la présence de « Startup » :

Jun  7 02:00:09 adl systemd[1]: Startup finished in 12.319s (kernel) + 2min 5.151s (userspace) = 2min 17.470s.

Du coup, c'est encore plus facile avec le status :D

#6

Updated by Christian P. Momon over 2 years ago

  • Status changed from Attente d'information to Résolu

Bilan après 4 mois en faisant attention à attendre la fin du démarrage du système avant de faire des actions : le problème n'est plus rencontré.

En conséquence, fermeture du ticket.

#7

Updated by Christian P. Momon about 2 years ago

  • Project changed from Chapril to Infra Chapril
  • Status changed from Résolu to Fermé
#8

Updated by Christian P. Momon over 1 year ago

Also available in: Atom PDF