Upgrades

ENTERPRISE

BETA

Instructions for upgrading the DC/OS Storage Service on a DC/OS Enterprise cluster.

WARNING: Beta Software This software may contain bugs, incomplete features, incorrect documentation, or other discrepancies. Contact Mesosphere before deploying a beta candidate service. Product support is available to approved participants in the beta test program. Contact support@mesosphere.io for information about participation.

Manually upgrade the DSS package to 0.5.x from 0.4.x

DSS 0.5 requires DC/OS Enterprise 1.12.1 or higher. Therefore, to upgrade from DSS 0.4 to DSS 0.5, you will also need to upgrade DC/OS Enterprise. Please follow these steps to upgrade:

  1. Remove DSS created volumes and volume providers from all agents in the cluster.
  2. Uninstall DSS.
  3. Remove volume, volume providers, and related assets from Zookeeper.
  4. Check agents for lingering CSI plugins and/or DSS-created LVM volumes/groups. Then, for every agent, do the following:
    1. Stop the agent.
    2. Kill all CSI plugin processes (such as, csilvm and devices-plugin).
      $ pkill csilvm && pkill devices-plugin
      
    3. Remove files under the Mesos resource provider configuration directory.
      $ rm /var/lib/dcos/mesos/resource-providers/*
      
    4. Remove checkpointed CSI volume state.
      $ rm -r /var/lib/mesos/slave/csi/
      
    5. Remove the latest symlink for each Mesos resource provider.
      $ rm /var/lib/mesos/slave/meta/slaves/latest/resource_providers/org.apache.mesos.rp.local.storage/*/latest
      
    6. Remove any DSS-created LVM volume groups and/or volumes.
      $ vgremove -f some-volumegroup-name
      
  5. Upgrade DC/OS Enterprise to 1.12.1.
  6. Install DSS 0.5.
  7. Follow these steps to install device providers on desired nodes.

If DSS 0.4 is not removed before upgrading DC/OS Enterprise to 1.12.1, you might see the following fatal error on agents:

Check failed: update.status().resource_provider_id() == resourceProvider->info.id()

If you encounter this error, please downgrade all failed agents to DC/OS Enterprise 1.12.0 and follow the above steps, then upgrade the agents again.

Manually upgrade the DSS package to 0.4.0 from 0.3.0

Please follow these steps to upgrade:

  1. Remove DSS created volumes and volume providers from all agents in the cluster.
  2. Uninstall DSS.
  3. Remove volume, volume providers, and related assets from Zookeeper.
  4. Check agents for lingering CSI plugins and/or DSS-created LVM volumes/groups. Then, for every agent, do the following:
    1. Stop the agent.
    2. Kill all CSI plugin processes (such as, csilvm and devices-plugin).
      $ pkill csilvm && pkill devices-plugin
      
    3. Remove files under Mesos resource provider configuration directory.
      $ rm /opt/mesosphere/etc/mesos/resource-providers/*
      
    4. Remove checkpointed CSI volume state.
      $ rm -r /var/lib/mesos/slave/csi/
      
    5. Remove the latest symlink for the agent.
      $ rm /var/lib/mesos/slave/slaves/latest
      
    6. Remove any DSS-created LVM volume groups and/or volumes.
      $ vgremove -f some-volumegroup-name
      
  5. Install DSS 0.4.
  6. Follow these steps to install device providers on desired nodes.

Manually upgrade the DSS package to 0.3.0 from 0.2.0

DSS 0.3 requires DC/OS Enterprise 1.12 or later, but DSS 0.2 only works for DC/OS Enterprise 1.11. Therefore, to upgrade from DSS 0.2 to DSS 0.3, you will also need to upgrade DC/OS Enterprise.

Since there are some breaking changes between DSS 0.2 and DSS 0.3, please perform the following steps to upgrade:

  1. Remove DSS created volumes and volume providers from all agents in the cluster.
  2. Uninstall DSS.
  3. Remove volume, volume providers, and related assets from Zookeeper.
  4. For every agent, do the following:
    1. Stop the agent.
    2. Kill all CSI plugin processes (such as, csilvm and devices-plugin).
      $ pkill csilvm && pkill devices-plugin
      
    3. Remove files under Mesos resource provider configuration directory.
      $ rm /opt/mesosphere/etc/mesos/resource-providers/*
      
    4. Remove checkpointed CSI volume state.
      $ rm -r /var/lib/mesos/slave/csi/
      
    5. Remove the latest symlink for the agent.
      $ rm /var/lib/mesos/slave/slaves/latest
      
  5. Upgrade DC/OS Enterprise to 1.12.
  6. Install DSS 0.3.
  7. Follow these steps to install device providers on desired nodes.

Manually upgrade the DSS package to v0.2.0 from v0.1.0

  1. Uninstall the currently installed DSS package and package repository.
  2. Remove volume, volume providers, and related assets from Zookeeper.
  3. Remove DSS-created volumes and volume providers from all agents in the cluster.
  4. Install the latest release of DSS.