Problem
You see the following event occuring daily at about 6:12 AM on your SharePoint 2013 farm servers:
This error is generated if the Config Refresh timer job finds out-of-sync caches among the SharePoint servers. System topology includes one application server and two web front end servers.
This posting unfortunately does not present a solution, but documents troubleshooting steps and reference articles for future reference.
Troubleshooting
You see the following event occuring daily at about 6:12 AM on your SharePoint 2013 farm servers:
Log Name: Application Source: Microsoft-SharePoint Products-SharePoint Foundation Date: [date/time] Event ID: 6398 Task Category: Timer Level: Critical Keywords: User: [DOMAIN/FarmServiceAcct] Computer: [SharePoint Server] Description: The Execute method of job definition Microsoft.SharePoint.Diagnostics. SPDiagnosticsMetricsProvider (ID 9cde39fb-4971-4a03-9612-0978098691d7) threw an exception. More information is included below. An update conflict has occurred, and you must re-try this action. The object SPWebService was updated by [DOMAIN/FarmServiceAcct], in the OWSTIMER (8696) process, on machine [SharePoint Server]. View the tracing log for more information about the conflict. Event Xml: ...
This error is generated if the Config Refresh timer job finds out-of-sync caches among the SharePoint servers. System topology includes one application server and two web front end servers.
This posting unfortunately does not present a solution, but documents troubleshooting steps and reference articles for future reference.
Troubleshooting
- Check cache ID
- APP1: 2248964
- WFE1: 2248968
- WFE2: 2248970
Note: this value is changing constantly. if you open each cache.ini file on each machine separately, you may get different values - not because the caches are out of sync but due to the cache ID changing from when you open the file on one machine to when you open it on another machine. To get an accurate snapshot of this value at any moment in time for all machines, have remote sessions open on all machines simulataneously, and then in quick succession make copies of the file on each one. Then open these copies to determine the actual cache ID.
- Clear cache
- First, stopped Timer service on all SharePoint servers.
- Then on each server, starting with App1:
- Navigated to C:\ProgramData\Microsoft\SharePoint\Config.
- Looked for current GUID folder (check dates).
- Deleted all XML files in this folder.
- Replaced contents of cache.ini with "1".
- Started timer service on APP1
- Waited for it to rebuild fully.
Note: if you have security software (McAfee, etc) installed, it will significantly consume resources scanning the creation of all of the new cache files, temporarily adversely impacting performance.
- 1658 XML cache files generated
- Started timer service on WFE1
- Waited for it to rebuild fully.
- 1658 XML cache files generated
- Refreshed application logs APP1 and WFE1
- Started timer service on WFE2
- Waited for it to rebuild fully
- 1658 XML cache files generated
- Verify cache IDs
- After cache rebuild completed, checked contents of each cache.ini:
- APP1: 2248998
- WFE1: 2248998
- WFE2: 2248998
See note above on getting accurate values for ID.
- Verify solution
- Check event logs one day later: same issue recurring
- Check event logs five days later: same issue recurring.
- None found at this time 12/29/14.
- Error con el job SPDiagnosticsMetricsProvider
- Sharepoint 2010 – Clearing the Configuration Cache
- Error message when you try to modify or to delete an alternate access mapping in Windows SharePoint Services 3.0: "An update conflict has occurred, and you must re-try this action"
- SharePoint 2013: Event ID 6398 An update conflict has occurred, and you must re-try this action
- SharePoint 2013 Error 6398 The Execute method of job definition Microsoft.SharePoint.Diagnostics.SPDiagnosticsMetricsProvider threw an exception
- Clear SharePoint Config Cache with PowerShell
- Timer Event 6398 - SPDatabaseServerDiagnosticsPerformanceCounter
- Also checked development farm servers: APP1: 2733002, WFE1: 2733006 and WFE2: 2733006. After performing the above procedure: APP1: 2733071, WFE1: 2733071 and WFE2: 2733071. Same experience: issue continues to occur after clearing cache.
1 comment:
Until Microsoft will add timer job configuration on server level and not on farm level, similar issues will remain.
Post a Comment