Skip to content

[BUG] Linux Node cannot be found after reboot host machine #252

@jinrongchi

Description

@jinrongchi

Describe the bug
The Linux node cannot start running after the machine reboot (due to Windows updates)
Start-AksEdgeNode error states the network interface is missing
Error: rpc error: code = Unknown desc = Endpoint Name Interface not found]

To Reproduce
Steps to reproduce the behavior:

  1. Deploy a Single Machine Cluster
  2. Reboot (for Windows Updates)
  3. Run Command Start-AksEdgeNode -NodeType Linux
  4. Exception Caught - The endpoint not found

Expected behavior
The node should restart successfully.
kubectl command should be able to connect the cluster and check resources.

Screenshots
AksEdge cmdlet errors:

Image

Image

Image

kubectl cmdlet error:

Image

The config file that I used to deploy the cluster:

{
  "SchemaVersion": "1.9",
  "Version": "1.0",
  "DeploymentType": "SingleMachineCluster",
  "Init": {
    "ServiceIPRangeSize": 1
  },
  "Network": {
    "NetworkPlugin": "flannel",
    "Ip4AddressPrefix": "192.168.0.0/24",
    "SkipAddressFreeCheck": true,
    "DnsServers": [
      "10.140.103.250",
      "10.145.103.250"
    ],
    "InternetDisabled": true,
    "SkipDnsCheck": false,
    "Proxy": {
      "Http": null,
      "Https": null,
      "No": "localhost,127.0.0.0/8,192.168.0.0/16,172.17.0.0/16,10.42.0.0/16,10.43.0.0/16,10.96.0.0/12,10.244.0.0/16,.svc"
    },
    "Ip4PrefixLength": 24
  },
  "User": {
    "AcceptEula": true,
    "AcceptOptionalTelemetry": true,
    "VolumeLicense": {
      "EnrollmentID": null,
      "PartNumber": null
    },
    "Force": null
  },
  "Machines": [
    {
      "LinuxNode": {
        "CpuCount": 8,
        "MemoryInMB": 16384,
        "LogSizeInGB": 10,
        "DataSizeInGB": 30,
        "TimeoutSeconds": 300,
        "TpmPassthrough": false,
        "SecondaryNetworks": [
          {
            "VMSwitchName": null,
            "Ip4Address": null,
            "Ip4GatewayAddress": null,
            "Ip4PrefixLength": null
          }
        ],
        "ControlPlane": true
      },
      "NetworkConnection": {
        "AdapterName": null,
        "Mtu": 0
      }
    }
  ]
}

Environment (please complete the following information):

  • AKS Edge Essentials Version [AksEdge-K3s-1.27.6-1.6.384.0.msi]
  • Kubernetes version [1.26.10]
  • Windows Host OS (please complete the following information):
    • Edition: [W11 Enterprise]
    • Version: [10.0.26100 Build 26100]
    • Virtual Machine: [Local]

Additional context
None

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions