I have a weird issue. On a serverfarm I manage, User Profile Disks randomly stop working, causing users who login to loose access to their desktop, including the administrator user.
Server configuration
All servers are virtualized using Proxmox in a clustered setup. The following servers are present:
- DC: This server primarily functions as a domain controller and fileserver.
- S: This is an app server that functions as a terminal server, but used specifically for certain apps. This server is not utilized much.
- TS (6x): 6 different Terminal Servers that are configured identical, running the same software and having the same Group Policies applied. Office and Teams are both installed, and users have E3 licenses and single-sign-on is setup with ADSync.
- WEB: Broker server that also is being used to start an RDP connection by visiting a web page.
All virtual servers run Windows Server 2016 and are up-to-date when it comes to Windows Update.
Added security: 2FA using DUO mobile for both RDP Web and RDP Gateway sign-in.
The UserProfileDisks are stored on the DC.
The problem
Users login, the broker assigns a server based on least usage and the servers get filled with members. At a random moment, usually in the morning, A server can decide to stop working correctly. The next user that logs in does not get their userprofiledisk loaded, and gets a black screen with only a recycle bin on it and the start menu does not work. We can log out a user from the DC and if the user logs in again, if they get send back to the same problematic server, it happens again. If they get sent to another server, they get lucky and they login normally.
Currently, whenever this happens, we disable login for this TS from the broker server and schedule a reboot. A reboot always fixes the issue, but given that users are able to login before the problem starts happening, we can't just reboot the server at that moment. That said, I want to fix the issue so it does not come back.
When it happens and I log into as Administrator, I get an error message that location C:\Users\Administrator\Desktop is not available. As administrator, I can access the task manager but the start menu does not work. I can press CTRL-ALT-END to log off too.
If I go to C:\Users\ I see shortcuts (symlinks). The one for Administrator does not work anymore at that point. I can delete it manually, or it sometimes gets deleted when I log out, but once deleted, it does not come back. I also see lots of temporary profiles at this point. When things do work, the symlinks are created and I can double click them to access the folder.
I can't find anything useful in the EventLog, but then again, I don't know what I'm looking for, so I may have missed something obvious. I did check all errors though, and did not find anything that could explain what is happening.
EDIT: I have now found an eventID that may be the root cause, but I have no idea why this is happening.
EventID: 158 Disk 14 has the same disk identifiers as one or more disks connected to the system. Go to Microsoft's support website (http://support.microsoft.com) and search for KB2983588 to resolve the issue.
I've looked through all GUID's of the profile disks, but they are unique.
0 Answers