Review of Sorting Algorithms
- Selection Sort uses a priority queue P implemented with an unsorted sequence:
- Phase 1: the insertion of an item into P takes O(1) time; overall O(n)
- Phase 2: removing an item takes time proportional to the number of elements k in P, Q(k); overall Q(n2)
- Total time complexity: Q(n2)
- Insertion Sort is performed on a priority queue implemented as a sorted sequence:
- Phase 1: the first insertItem takes O(1), the second O(2), until the last insertItem takes O(n); overall O(n2).
- Phase 2: removing an item takes O(1) time; overall O(n).
- Total time complexity: O(n2)
- Heap Sort uses a priority queue implemented as a heap.
- insertItem and removeMinElement each take O(log k) where k is the number of elements in the heap at a given time.
- Phase 1: n elements inserted: O(n log n) time.
- Phase 2: n elements removed: O(n log n) time.
- Total time complexity: O(n log n)
- Divide and conquer is more than just a military strategy, it is also a method of algorithm design that has led to efficient algorithms such as merge-sort and quick-sort.
- This algorithmic approach has three distinct steps:
- Divide: If the input size is too large to deal with in a trivial (or nearly trivial) manner, divide the data into two or more disjoint subsets.
- Recurse: Use the same divide-and-conquer algorithm to solve the subproblems associated with the data subsets.
- Conquer: Take the solutions to these now-solved subproblems and "merge" them into the solution for the original problems.
- Algorithm:
- Divide: If S has at least two elements (nothing needs to be done if S has zero or one elements), remove all the elements from S and put them into two subsequences, S1 and S2, each containing about half of the elements of S.
- S1 contains the first én/2ù elements,
- S2 contains the remaining ën/2û elements.
- Recurse: Recursively sort subsequences S1 and S2.
- Conquer: Put back the elements into S by merging the now-sorted subsequences S1 and S2 into a sorted final sequence S.
Merge-Sort Tree
- A conceptual tool to visualize the execution of merge sort.
- Not a data structure.
- Take a binary tree T.
- Each node of T represents a recursive call of the merge-sort algorithm.
- We associate with each node v of T the set of input passed to the invocation that v represents.
- The children of node v are associated with the recursive calls on the subsequences of that node.
- The external nodes represent base cases and are associated with individual elements of S, on which no further recursion is required.
Merge-Sort Example
data:image/s3,"s3://crabby-images/31154/311541146e333e2dc3bf1e7a29d8954d5d38a476" alt=""
data:image/s3,"s3://crabby-images/b37fa/b37faf46694cee1c51d07cb1706525a59bf3a6f9" alt=""
Merge-Sort Example (cont.)
data:image/s3,"s3://crabby-images/f1a59/f1a59bd2180bb91d134df75acb660d579a2bd024" alt=""
data:image/s3,"s3://crabby-images/c25de/c25debb8be3dd4f90b0d3d2f91515b2b0dda302d" alt=""
Merge-Sort Example (cont.)
data:image/s3,"s3://crabby-images/3053e/3053e669244dc5d5fb1e0d78583a2b4fb834e2df" alt=""
data:image/s3,"s3://crabby-images/7015e/7015ec90371a191fe7f17a890b657d8259251b38" alt=""
Merge-Sort Example (cont.)
data:image/s3,"s3://crabby-images/796c9/796c9bfe8d0b57cb608d20b729691a2acfc5e191" alt=""
data:image/s3,"s3://crabby-images/ba3a5/ba3a50900edd692adee411cbbc62abf8aa527249" alt=""
Merge-Sort Example (cont.)
data:image/s3,"s3://crabby-images/d039b/d039bff7f41446535abc18e1a8eeb08840efc359" alt=""
data:image/s3,"s3://crabby-images/b50d0/b50d07d60ed80370363f5fe9d5ea208d0663ce7e" alt=""
Merge-Sort Example (cont.)
data:image/s3,"s3://crabby-images/d7ef2/d7ef2e214edaad034642ce962ae5f778b15d17f7" alt=""
data:image/s3,"s3://crabby-images/e4660/e46604472e3e67fca2aba521c29e2aa3e3932aa1" alt=""
Merge-Sort Example (cont.)
data:image/s3,"s3://crabby-images/0db77/0db771d923c4ea28cc002230ac443610edcb6e60" alt=""
data:image/s3,"s3://crabby-images/294c6/294c6fb7e94d0e70ef43b3c411654ee965f2e972" alt=""
Merge-Sort Example (cont.)
data:image/s3,"s3://crabby-images/15f4c/15f4cb8e46be4563cea7536e68b098d0eaff84f6" alt=""
data:image/s3,"s3://crabby-images/75fff/75fff005b03c59cd7a1dff1b698d2f50a20f796e" alt=""
Merge-Sort Example (cont.)
data:image/s3,"s3://crabby-images/dcb79/dcb79f36038ea7656190051f2848dedaaa2bab8d" alt=""
data:image/s3,"s3://crabby-images/8bc76/8bc76dd7465476a92bfb22a7d2a59d7f8cf2027e" alt=""
Merge-Sort Example (cont.)
data:image/s3,"s3://crabby-images/79d3e/79d3e60871c5807d66d6d1458feb4821375ac153" alt=""
data:image/s3,"s3://crabby-images/04030/04030e23ceabca9abc84e453814e4d9a84d02b3e" alt=""
Merge-Sort Example (cont.)
data:image/s3,"s3://crabby-images/01c45/01c459770bfd17ed4e9db0f96442b07f5bb91745" alt=""
data:image/s3,"s3://crabby-images/a49a0/a49a0215358e2bc01db5984ce41ef4ebd432dae0" alt=""
Merging Two Sequences
- Pseudo-Code for the "conquer" step
Algorithmmerge(S1, S2, S)
- Input: Sequences S1
and S2, on whose elements a total order relation is defined
and which is sorted in nonedecreasing order, and an empty sequence S.Output: Sequence S containing the union
of the elements from S1 and S2 sorted
in nondecreasing order; sequences S1 and S2
become empty as a result of the execution of this algorithm.
while S1 is not empty and S2 is not empty do
if S1.first().element()£S2.first().element() then
// move the first element of S1 to the end of S S.insertLast(S1.remove(S1.first()))
else // move the last element of S2 to the end of S S.insertLast(S2.remove(S2.first()))
// move the remaining elements (if any) of S1 to Swhile S1 is not empty do
S.insertLast(S1.remove(S1.first()))
// move the remaining elements (if any) of S2 to S
while S2 is not empty do
S.insertLast(S2.remove(S2.first()))
a)
data:image/s3,"s3://crabby-images/0d227/0d227f1755b1f207f14e18407daa74e849632653" alt=""
b)
data:image/s3,"s3://crabby-images/26f4b/26f4bc06df34e78d5f404b701d1b80f5e4513604" alt=""
Merge Example (cont.)
c)
data:image/s3,"s3://crabby-images/a9864/a9864c6b5aace004858c656fc8c1950252082eec" alt=""
d)
data:image/s3,"s3://crabby-images/8c832/8c832ad78813318d5a86b6d76c28ec24746fe43e" alt=""
Merge Example (cont.)
e)
data:image/s3,"s3://crabby-images/ed160/ed160a02c4b3f44925764f6c8971434f842f143a" alt=""
f)
data:image/s3,"s3://crabby-images/af75e/af75e50c04428b23aa6d1a2515495c6dbda5647e" alt=""
Merge Example (cont.)
g)
data:image/s3,"s3://crabby-images/03197/0319745b7e05e2e3133590c2c0dbdac90de939e5" alt=""
h)
data:image/s3,"s3://crabby-images/04e71/04e71bddd35882b31fc06c351de7dd08b1991f0d" alt=""
Merge Example (cont.)
i)
data:image/s3,"s3://crabby-images/309cb/309cbad93fa5e710baef8740954d3ab3fa3d7d63" alt=""
Analysis of Merge-Sort
- Assume that n is a power of 2. The same result can be shown for general n.
-
Proposition 1: The merge-sort
tree associated with the execution of merge-sort on a sequence of n
elements has a height of
.
- Informal justification: If n is a power of 2, log n equals the number of times that a sequence of size n can be divided in half.
- Proposition 2: A merge-sort algorithm sorts a sequence of size n in O(n log n) time.
- Justification:
- Assume that access, insertion, and deletion operations on the first elements of S, S1, and S2 are O(1).
- When considering the time spent at node v in the recursive tree, only consider the time spent at that node exclusive of the time spent at node v’s children. Then the time spent at node v is proportional to the size of its sequence.
- Let i be the depth of node v. The size of node v’s sequence is n/2i Þ The cost of node v is O(n/2i).
- T has exactly 2i nodes at depth i. The time spent in all nodes at depth i is then O(2in/2i) = O(n).
- Since the tree has O(log n) height, the total cost of merge-sort is O(n log n).
Interface SortObject
public interface
SortObject {
// sort sequence S in nondecreasing order using comparator c
publicvoid sort(Sequence S, Comparator c);
}
// sort sequence S in nondecreasing order using comparator c
publicvoid sort(Sequence S, Comparator c);
}
public class
ListMergeSort implements SortObject {
publicvoid sort(Sequence S, Comparator c) {
int n = S.size();
// a sequence with 0 or 1 element is already sorted
if (n < 2) return;
// divide
Sequence S1 = (Sequence)S.newContainer();
for (int i=1; i <= (n+1)/2; i++) {
S1.insertLast(S.remove(S.first()));
}
Sequence S2 = (Sequence)S.newContainer();
for (int i=1; i <= n/2; i++) {
S2.insertLast(S.remove(S.first()));
}
// recur
sort(S1,c);
sort(S2,c);
//conquer
merge(S1,S2,c,S);
}
publicvoid sort(Sequence S, Comparator c) {
int n = S.size();
// a sequence with 0 or 1 element is already sorted
if (n < 2) return;
// divide
Sequence S1 = (Sequence)S.newContainer();
for (int i=1; i <= (n+1)/2; i++) {
S1.insertLast(S.remove(S.first()));
}
Sequence S2 = (Sequence)S.newContainer();
for (int i=1; i <= n/2; i++) {
S2.insertLast(S.remove(S.first()));
}
// recur
sort(S1,c);
sort(S2,c);
//conquer
merge(S1,S2,c,S);
}
Class ListMergeSort (cont.)
public void merge(Sequence S1, Sequence
S2, Comparator c, Sequence S) {
while(!S1.isEmpty() && !S2.isEmpty()) {
if(c.isLessThanOrEqualTo(S1.first().element(),S2.first().element())) {
S.insertLast(S1.remove(S1.first()));
}
else {
S.insertLast(S2.remove(S2.first()));
}
}
if(S1.isEmpty()) {
while(!S2.isEmpty()) {
S.insertLast(S2.remove(S2.first()));
}
}
if(S2.isEmpty()) {
while(!S1.isEmpty()) {
S.insertLast(S1.remove(S1.first()));
}
}
}
}
while(!S1.isEmpty() && !S2.isEmpty()) {
if(c.isLessThanOrEqualTo(S1.first().element(),S2.first().element())) {
S.insertLast(S1.remove(S1.first()));
}
else {
S.insertLast(S2.remove(S2.first()));
}
}
if(S1.isEmpty()) {
while(!S2.isEmpty()) {
S.insertLast(S2.remove(S2.first()));
}
}
if(S2.isEmpty()) {
while(!S1.isEmpty()) {
S.insertLast(S1.remove(S1.first()));
}
}
}
}
- Quick-sort
- is another divide-and-conquer sorting algorithm,
- can be performed "in-place" meaning that only a constant amount of memory is needed in addition to the memory required for the objects being sorted, and
- is probably the most commonly used sorting algorithm.
- High-level description:
- L, holds the elements of S £x.
- E, the pivot element (single-element sequence)
- G, holds the elements of S ³x.
data:image/s3,"s3://crabby-images/4cfac/4cfaccef3be18226b2334d3c0e9ff307223675d1" alt=""
data:image/s3,"s3://crabby-images/e5861/e58617b5386cebb0cce938f09b15a4f6e591b8ef" alt=""
Quick-Sort Tree (cont.)
data:image/s3,"s3://crabby-images/0d979/0d979f6909e9d0755372eccd5f2e1c1aa2b3d21e" alt=""
data:image/s3,"s3://crabby-images/d7f49/d7f49c7857becff5846649c4a4da6f0002c141cf" alt=""
Quick-Sort Tree (cont.)
data:image/s3,"s3://crabby-images/a0e50/a0e507e06539c87f4459d77e6ee31a47fcd8d5fb" alt=""
data:image/s3,"s3://crabby-images/47143/47143b702e867410d70d89de3928ec0a9a660b9e" alt=""
Quick-Sort Tree (cont.)
data:image/s3,"s3://crabby-images/be28d/be28d1f11c1209c074ffcc0bf112664577591246" alt=""
data:image/s3,"s3://crabby-images/e95f3/e95f352b0ebc4c8f4debfd1d709e85e5ad44c1f5" alt=""
Quick-Sort Tree (cont.)
data:image/s3,"s3://crabby-images/a76f3/a76f32a64976d79488658e4d48b5506182f83c62" alt=""
data:image/s3,"s3://crabby-images/d1ce8/d1ce8110dca9ba4dc00b414247aae1223e4d0088" alt=""
Quick-Sort Tree (cont.)
data:image/s3,"s3://crabby-images/f74f1/f74f16cf80f1360a9d68a1c8b4f49fe762d9501d" alt=""
data:image/s3,"s3://crabby-images/287e9/287e9a7509df9e2ddffa178965a724ef5afdd8ec" alt=""
Quick-Sort Tree (cont.)
data:image/s3,"s3://crabby-images/7e0bb/7e0bbbe4efde5465006a65f7c5b15f28f5655c20" alt=""
data:image/s3,"s3://crabby-images/702c5/702c52ddee1be1881ea1f4bb92d5a3ba296a1004" alt=""
Quick-Sort Tree (cont.)
data:image/s3,"s3://crabby-images/c593e/c593ef0310605be907a00afbaf394558edfa0f6b" alt=""
data:image/s3,"s3://crabby-images/a8caa/a8caa2d4174f4e3d17c06ae50d91fa4a3a9a8009" alt=""
Quick-Sort Tree (cont.)
data:image/s3,"s3://crabby-images/2fe68/2fe686301eb982a9d2142c54bb700827d8667481" alt=""
data:image/s3,"s3://crabby-images/46e39/46e390cad57f0a71ae9c89150f165e73dd529d23" alt=""
Quick-Sort Divide Step
- In-place quick-sort: The in-place algorithm uses array indices l and r to partitions the array-based sequence into subsequences L, E, G.
- l scans the sequence from the left, and r from the right.
data:image/s3,"s3://crabby-images/9c997/9c997d6b2d3e5014c7301cc41c2f9d666967dcc0" alt=""
- A swap is performed when l is at an element larger than the pivot and r is at one smaller than the pivot.
data:image/s3,"s3://crabby-images/4f84e/4f84e13dae6cdade31c5cc9502700a4b9880b7f1" alt=""
Quick-Sort Partition Step (cont.)
data:image/s3,"s3://crabby-images/593a5/593a55323d1bb207eac2e6359f5d8aace7bdd7eb" alt=""
- A final swap with the pivot completes the divide (i.e., partition) step.
data:image/s3,"s3://crabby-images/50916/50916a085e87e9c181db78cc280bb6d62d748696" alt=""
Analysis of Quick-Sort
- Consider a quick-sort tree T:
- Let si(n) denote the sum of the input sizes of the nodes at depth i in T.
- We know that s0(n) = n since the root of T is associated with the entire input set.
- Also, s1(n) = n-1, since the pivot is not propagated.
- Thus, either s2(n) = n – 3, or, if one of the subsequences at level 1 has zero input size, s2(n) = n - 2 in the worst case.
- In general, si(n) = n – i in the worst case.
- The worst-case running time of quick-sort is then:
data:image/s3,"s3://crabby-images/1aebf/1aebf078200c7a9f82af449be249b9ecf7b2ae5a" alt=""
- The worst-case running time of quick-sort is O(n2). This occurs when the input sequence is sorted. Nearly worst-case performance occurs on nearly sorted sequences.
- Now to look at the best-case running time of quick-sort:
- We can see that quick-sort behaves optimally if, whenever a sequence S is divided into subsequences L and G, the subsequences are of equal size.
- More precisely,
data:image/s3,"s3://crabby-images/03c07/03c077e220535d64fa0d85bfef3a427ae4a07add" alt=""
- The height of the tree is the maximum value of i for which
data:image/s3,"s3://crabby-images/ef2ac/ef2acbff9e4abbb6c323882eca000e7b3de0aef7" alt=""
- That is to say, the height of the quick-sort tree in the best case is h = log n, and the best-case performance of quick-sort is O(n log n).
- The main drawback to quick-sort is that it achieves its worst-case time complexity on data sets that are common in practice, namely sequences that are already sorted (or mostly sorted).
- To avoid this, modifications to quick-sort are available that are more careful about choosing the pivot.
- One such modification chooses a random element of the sequence. This modified version is known as randomized quick-sort.
- The expected time of a randomized quick-sort on a sequence of size n is O(n log n).
- Justification: see next page.
Cost of quick-sort on n items = partition step
+ cost of 2 recursive calls
Assume that the size of any subsequence from 0 to n-1
is equally likely.
data:image/s3,"s3://crabby-images/e1ba1/e1ba164b15f36ba72f3dfe5e3e202e51ec6942d3" alt=""
data:image/s3,"s3://crabby-images/b7176/b7176b68d81bdaff615eb8753e65ffd739d13b17" alt=""
data:image/s3,"s3://crabby-images/b3766/b376670be0d6407aeb0669a085fd842f6ad26c43" alt=""
which implies that
data:image/s3,"s3://crabby-images/dd746/dd746145b1576b61120b96203924684d02cd85c3" alt=""
Subtract the above two equations
data:image/s3,"s3://crabby-images/e6a1d/e6a1dc563891fbf587561be55230148f1807ceb8" alt=""
Rearrange and drop insignificant constant terms
data:image/s3,"s3://crabby-images/61c24/61c2400e65763c4bcc6ee4ff7bf3ef854f51b4e3" alt=""
Average-Case Analysis of Quick-Sort (cont.)
Divide by n(n+1)
data:image/s3,"s3://crabby-images/90e3f/90e3f1774201aa1752873504b041d16e753a5243" alt=""
Solve the recursive relationship
data:image/s3,"s3://crabby-images/2efc9/2efc9a735c6887818672e001d896ff56c4212aae" alt=""
where Hn is the nth
Harmonic number which can be bounded above and below by
.
data:image/s3,"s3://crabby-images/6a285/6a285095ebdfdd7019edc66d7521f345c3cd7028" alt=""
Thus
.
data:image/s3,"s3://crabby-images/6e9fa/6e9fa6208345df14734b7f8bc142b9a53059ab35" alt=""
Selection and Order Statistics
Order Statistic - Identify
a single element relative to its rank in the collection.
example: median, element
that would be at rank
data:image/s3,"s3://crabby-images/47ba4/47ba4b77cd8e339ea43cd18a7a273868db9d28d5" alt=""
Selection problem – select
the kth smallest element from a collection of n similar elements.
-
Sort the collection:
- Randomized quick-select algorithm
- O(n2) worst-case performance, but
- O(n) expected performance
- Similar to randomized quick-sort. Chooses an arbitrary element from the collection as the pivot and recursively calls itself.
Algorithm quickSelect(S,k):
Input: Sequence S of n comparable elements Input: integer![]()
Output: The kth smallest element of S if n = 1 then
return the (first) element of S
Pick a random integer r in the range [0,n-1]
Let x be the element of S at rank r
Remove all the elements from S and put them into three sequences:
- L, storing the elements in S less than x
- E, storing the elements in S equal to x
- G, storing the elements in S greater than x
if k £L.size()then
quickSelect(L,k)
else if k £L.size() + E.size() then
// each element in E is equal to x
return x
else
// note the new selection parameter
quickSelect(G,k-L.size()-E.size())
No comments:
Post a Comment