Advanced Debugging Techniques

whoami
David Szöke
Application Engineer @ Unic, Switzerland
2

IIS Crash Loop
▸ PRD Environment
▸ Authoring environment suddenly
stopped working, 503
▸ Application Pool disabled
▸ IIS Rapid fail protection kicked in
▸ Nothing in Sitecore logs and Windows
Event logs
▸ No solution updates
5

Gather a crash dump
▸ Download and install Debugging Tools
for Windows: https://bit.ly/2WePj2D
▸ Run adplus.exe in crash mode
6
.adplus.exe -pmn "w3wp.exe" -o c:tempcrash-analysis -crash -FullOnFirst

Automated analysis using
DebugDiag
▸ Download and install DebugDiag:
https://bit.ly/2IkVWXx
7

DebugDiag Report
▸ DebugDiag will generate a report:
8

Automated analysis using
PostMortem
▸ PostMortem: https://bit.ly/2JOjUgT
▸ Will also generate a report
9

Correlate gathered information
10

Fixing PRD
▸ What item is breaking PRD?
▸ Why doesn’t it let Sitecore start up?
▸ WinDbg: Also part of Debugging Tools
for Windows
▸ Can load dump files
▸ Supports managed applications (SOS)
▸ Common commands, sequence and
SOS initialization: https://bit.ly/2EGIiN2
11

Finding the conflicting item
12
0:115>!ClrStack -p
[…]
0000007763ec6a80 00007ffcc6038f54
Sitecore.Crash.Foundation.Indexing.ComputedFields.ReferencedItemComputedField.GetRootId(Sitecore.D
ata.Items.Item)
PARAMETERS:
this (0x0000007763ec6b20) = 0x0000018da58644b8
item (0x0000007763ec6b28) = 0x0000018aa8779160
0:115> !DumpObj /d 0000018aa8779160
[...]
MT Field Offset Type VT Attr Value Name
00007ffcc305a1d8 4001f75 8 Sitecore.Data.ID 0 instance 0000018a26f57f38 _itemID
00007ffd1dad7aa0 4001f76 10 System.Object 0 instance 0000018aa8779230 _lock
00007ffcc4f14178 4001f77 18 ...ms.ItemAppearance 0 instance 0000000000000000 _appearance
00007ffcc4b852f0 4001f78 20 ...ta.Items.ItemAxes 0 instance 0000000000000000 _axes
0:115> !DumpObj /d 0000018a26f57f38
[...]
MT Field Offset Type VT Attr Value Name
00007ffd1dac2510 4001c89 18 System.Guid 1 instance 0000018a26f57f50 _guid
00007ffd1dad9e10 4001c8a 10 System.Int32 1 instance -520826525 _hashCode
0:115> dt nt!_GUID 0000018a26f57f50
ntdll!_GUID
{d5729bfe-e486-4a37-8e0d-d1af8738a6aa}

Event Queue
13
▸ Event not marked as done until
processing completed (on web)
select * from dbo.EventQueue where InstanceData like '%d5729bfe-e486-4a37-8e0d-d1af8738a6aa%'
delete from dbo.EventQueue where InstanceData like '%d5729bfe-e486-4a37-8e0d-d1af8738a6aa%'

Summary
▸ Data structure expected as tree, but
received graph
▸ Recursion not handled in code
▸ StackOverflowException
▸ Event not marked as completed
15

Memory Leak
▸ Noticed by Operations / Monitoring
▸ Related to Index Rebuilds
▸ Got worse over time (more items)
17

Gathering hang dumps
▸ Using adplus in hang mode
▸ Two separate dumps: before and after
18
.adplus.exe -p 11452 -o c:tempcrash-analysismemory-leak -hang

Getting an overview
▸ 2GB retained by Byte[] / ServiceProvider
▸ Compare to vanilla instance
20

Investigating the
ServiceProvider
22

Further Research & Sitecore
Support
▸ Service resolved in computed index fields
▸ https://github.com/aspnet/DependencyInjection
/issues/456
▸ Answer from Sitecore Support:
24
The thing is that some of the Sitecore functionality depends on the
current version of the "Microsoft.Extensions.DependencyInjection.dll" file.
Unfortunately, replacing the mentioned file with a newer version was not
deeply tested, so it could cause some breaking changes and unexpected
behavior.
So, I kindly ask you consider upgrading your Sitecore instance to Sitecore
9.1.

RabbitMQ processing stuck
▸ Customer noticed that items are
missing
▸ Middleware logs looked fine
▸ Messages were still in RabbitMq
▸ Nothing in Sitecore logs
26

Gathering some crash dumps
▸ At least 2, some minutes apart
27
.adplus.exe -p 34084 -hang -o C:Tempcrash-analysisimport-stuck

Back to DebugDiag
30
SolrNet.Impl.SolrConnection.Get(System.String, System.Collections.Generic.IEnumerable`1>)+1a7
SolrNet.Impl.SolrQueryExecuter`1[[System.__Canon, mscorlib]].Execute(<..>)+71
Sitecore.ContentSearch.SolrProvider.LinqToSolrIndex`1[<..>GetResult(<..>)+297
Sitecore.ContentSearch.SolrProvider.LinqToSolrIndex`1[<..>].Execute[<..>](<..>)+306
Sitecore.Crash.Feature.EventHandlers.ItemSaved.EnsureUniqueTitle.EnsureUnique(<..>)+3f1
Sitecore.Crash.Feature.EventHandlers.ItemSaved.EnsureUniqueTitle.OnItemSaving(<..>)+217
Sitecore.Events.Event+EventSubscribers.RaiseEvent(<..>)+31f
Sitecore.Events.Event.RaiseEvent(System.String, System.Object[])+186

Summary
▸ Query did not use exact match
▹ Tab interpreted as space by SolR
▸ Input data not sanitized
▸ Method stuck in endless loop
▸ Implemented sanitization in
Middleware and Sitecore
▸ Re-coded unique lookup logic
34

Recap
Extending
your Toolkit
35

dotMemory
▸ Memory Profiler
▸ Visualizes Memory
▸ Automated analysis
▸ Shows Retentions and dependencies
▸ Compare snapshots
▸ YouTube playlist: https://bit.ly/2IdeDw9
36

dotTrace
▸ Performance Profiler
▸ Analyze Locks and Hangs
▸ Performance analysis
▸ Compare snapshots
▸ Different data collection modes
▸ YouTube playlist: https://bit.ly/2Kjtfg5
37

WinDbg
▸ Analyze all the data!
▸ Scriptable + APIs (clrmd)
▸ Steep learning curve, but rewarding
▹ CLR internals
▸ Also a debugger
▸ Recently reworked (Windbg Preview, MS Store)
▸ Doug Rathbone: Investigating ASP.Net Memory
Dumps for Idiots (like Me):
https://bit.ly/2Wae76W
38

DebugDiag Analysis
▸ Automated analysis
▸ Extendable
▸ Great for comparing hang dumps
▸ F1 
39

Advanced Debugging Techniques

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Advanced Debugging Techniques

Similar to Advanced Debugging Techniques (20)

Recently uploaded

Recently uploaded (20)

Advanced Debugging Techniques

Editor's Notes