Like this presentation? Why not share!

# .Net Collection Classes Deep Dive - Rocksolid Tour 2013

## on Jan 23, 2013

• 824 views

### Views

Total Views
824
Views on SlideShare
824
Embed Views
0

Likes
0
1
0

No embeds

### Report content

• Comment goes here.
Are you sure you want to
• Why should we care?It’s all about performance.Performance is the most important thing… apart from everything elsePerformance is like currency, the more you have, the more stuff you can buy.

## .Net Collection Classes Deep Dive - Rocksolid Tour 2013Presentation Transcript

• Collection Classes Deep Dive By Gary Short Head of Gibraltar Labs Gibraltar Software 1
• Introduction• Gary Short• Head of Gibraltar Labs• C# MVP• @garyshort• gary.short@gibraltarsoftare.com• facebook.com/TheOtherGaryShort 2
• 4
• List<T> Demo 6
• What we Learned• Don’t add elements in a loop• Add causes capacity growths• Capacity growths uses Array.Copy()• Array.Copy() is a O(n) operation• O(n) is sloooooowwwwwww. • Use AddRange() instead• Or set “large enough” initial capacity. 7
• How Slow is Slow? 8
• What About Removing Stuff? 10
• Demo 11
• What we LearnedPrefer RemoveAt() as there’s no IndexOf() step 12
• List<T> - Sorting• Uses QuickSort under the hood• Fastest general purpose sort algorithm• O(n log n) in best case• O(n log n) in average case• Though worst case is O(n^2)  13
• Performance: O(n log n) Vs O(n^2) 120 100 80Effort 60 O(n log n) O(n^2) 40 20 0 1 2 3 4 5 6 7 8 9 10 Elements to be Sorted
• QuickSort Demo 15
• So What is the Worst Case?• If the list is already sorted – First partition has lower = 0, upper = n – Then calls Partition(n-1); – This happens a further n-2 times 16
• Can we Mitigate the Worst Case?• Median of Three – Take an element from the “top” of the array – Take an element from the “middle” of the array – Take an element from the “bottom” of the array – Find the median value of the three – Pivot on the median• Let’s see if Microsoft uses this algorithm. 18
• What if we Need Fast Add, Insert & Remove? 20
• LinkedList<T>• Double linked – Each item points to the previous and next items – This means it’s super fast • Add, insert and remove are all O(1) operations 21
• Demo 22
• What if we Need Fast Lookups? 24
• Dictionary<TKey, TValue>• Performance depends on key.GetHashCode() – Hash codes must be evenly distributed across int • If two keys return hashes that give the same index – Dictionary must look for nearest free location to store item – Must search later to return the item – This hurts performance – Use your own type, then this is on you.  25
• Dictionary<TKey, TValue>• Objects used as keys must also implement IEquatable.Equals()• Or override Equals()• Why? – Different keys may return the same hashcode – Equals() is used by the dictionary comparing keys – So you must ensure the following • If A.Equals(B) then A.HashCode() and B.HashCode() return the same HashCode() • Override Equals() but not GetHashCode() == compile error. 26
• Disadvantage: one value per key 27
• What if I Need Multiple Values per Key? 28
• Lookup<TKey, TElement> Demo 29
• Concurrent Collections 30
• Types of Concurrent Collections• ConcurrentBag<T>• ConcurrentDictionary<T>• ConcurrentQueue<T>• ConcurrentStack<T>• OrderablePartitioner<T>• BlockingCollection<T>. 31
• Key Characteristics• New .Net 4.0• Guards against multi-thread collection conflicts• Implements IProducerConsumerCollections<T> – TryAdd() • Tries to add item to collection returns success bool – TryTake() • Tries to remove and return item returns success bool – Returns the item in an out param.• Always check the return value before moving on. 32
• Do I Have To Check Every Time?!• BlockingCollection<T> – Blocks and waits until task completes – Uses Add() and Take() methods • Block the thread and wait until task completes • Add() has an overload to pass a CancellationToken • Add() may also block if bounding capacity was used. 33
• But I Don’t Want it to Wait For Ever!• So we don’t want to wait forever• Nor do we want to cancel the Add() from outside• TryAdd() and TryTake() are offered too• Where you can specify a timeout. 34
• Summary• List is a good general purpose collection – Construct to size if possible – Construct to upper threshold then trim – Prefer AddRange() over Add() – Be aware of “Quicksort Killers”• Use LinkedList if you need fast insert/remove• Use Dictionary if you need fast lookup• Use Lookup if you need multi values• Use concurrent collections for thread safety. 35
• Questions• gary.short@gibraltarsoftware.com• @garyshort• Facebook.com/TheOtherGaryShort 36