Arrays, Lists & Trees - Codes
Arrays, Lists & Trees - Codes
How do you reverse a singly linked list? How do you reverse a doubly linked list? Write a C program to
do the same.
This is THE most frequently asked interview question. The most! Singly linked lists. Here are a few C programs to
reverse a singly linked list.
Method1 (Iterative)
#include <stdio.h>
// Variables
typedef struct node
{
int value;
struct node *next;
}mynode;
// Functions
void add(int value);
void iterative_reverse();
void print_list();
//Print it
print_list();
// Reverse it.
iterative_reverse();
//Print it again
print_list();
return(0);
}
head = p;
}
if(head==(mynode *)0)
{
head=temp;
tail=temp;
}
else
{
tail->next=temp;
tail=temp;
}
}
#include <stdio.h>
// Variables
typedef struct node
{
int value;
struct node *next;
}mynode;
// Globals.
mynode *head, *tail, *temp;
// Functions
void add(int value);
mynode* reverse_recurse(mynode *root);
void print_list();
//Print it
print_list();
// Reverse it.
if(head != (mynode *)0)
{
temp = reverse_recurse(head);
temp->next = (mynode *)0;
}
//Print it again
print_list();
return(0);
}
if(head==(mynode *)0)
{
head=temp;
tail=temp;
}
else
{
tail->next=temp;
tail=temp;
}
}
#include <stdio.h>
// Variables
typedef struct node
{
int value;
struct node *next;
}mynode;
// Functions
void add(mynode **head, mynode **tail, int value);
mynode* reverse_recurse(mynode *current, mynode *next);
void print_list(mynode *);
int main()
{
mynode *head, *tail;
head=(mynode *)0;
//Print it
print_list(head);
// Reverse it.
head = reverse_recurse(head, (mynode *)0);
//Print it again
print_list(head);
getch();
return(0);
}
if(current==(mynode *)0)
{
return((mynode *)0);
}
ret = (mynode *)0;
if (current->next != (mynode *)0)
{
ret = reverse_recurse(current->next, current);
}
else
{
ret = current;
}
current->next = next;
return ret;
}
// Function to add new nodes to the linked list.
// Takes pointers to pointers to maintain the
// *actual* head and tail pointers (which are local to main()).
if(*head==(mynode *)0)
{
*head=temp1;
*tail=temp1;
}
else
{
for(temp2 = *head; temp2->next!= (mynode *)0; temp2=temp2->next);
temp2->next = temp1;
*tail=temp1;
}
}
This is really easy, just keep swapping the prev and next pointers and at the end swap the head and the tail:)
#include<stdio.h>
#include<ctype.h>
int main()
{
head=NULL;
tail=NULL;
add_node(1);
add_node(2);
add_node(3);
add_node(4);
add_node(5);
print_list();
reverse();
print_list();
return(1);
if(head == NULL)
{
printf("\nAdding a head pointer\n");
head=temp;
tail=temp;
temp->value=value;
}
else
{
for(cur=head;cur->next!=NULL;cur=cur->next);
cur->next=temp;
temp->prev=cur;
temp->value=value;
tail=temp;
}
void print_list()
{
mynode *temp;
printf("\n--------------------------------\n");
for(temp=head;temp!=NULL;temp=temp->next)
{
printf("\n[%d]\n",temp->value);
}
void reverse()
{
mynode *cur, *temp, *save_next;
if(head==tail)return;
if(head==NULL || tail==NULL)return;
for(cur=head;cur!=NULL;)
{
printf("\ncur->value : [%d]\n",cur->value);
temp=cur->next;
save_next=cur->next;
cur->next=cur->prev;
cur->prev=temp;
cur=save_next;
}
temp=head;
head=tail;
tail=temp;
}
Given only a pointer to a node to be deleted in a singly linked list, how do you delete it?
The solution to this is to copy the data from the next node into this node and delete the next node!. Ofcourse this
wont work if the node to be deleted is the last node. Mark it as dummy in that case. If you have a Circular linked
list, then this might be all the more interesting. Try writing your own C program to solve this problem. Having a
doubly linked list is always better.
How do you sort a linked list? Write a C program to sort a linked list.
This is a very popular interview question, which most people go wrong. The ideal solution to this problem is to
keep the linked list sorted as you build it. Another question on this website teaches you how to insert elements
into a linked list in the sorted order. This really saves a lot of time which would have been required to sort it.
The general idea is to decide upon a sorting algorithm (say bubble sort). Then, one needs to come up with
different scenarios to swap two nodes in the linked list when they are not in the required order. The different
scenarios would be something like
1. When the nodes being compared are not adjacent and one of them is the first node.
2. When the nodes being compared are not adjacent and none of them is the first node
3. When the nodes being compared are adjacent and one of them is the first node.
4. When the nodes being compared are adjacent and none of them is the first node.
One example bubble sort for a linked list goes like this (working C code!)....
#include<stdio.h>
#include<ctype.h>
print_list("myList(BEFORE)", head);
head = bubbleSort(head, count);
print_list("myList(AFTER) ", head);
getch();
return(0);
if(*head == NULL)
{
*head=temp;
temp->value=value;
}
else
{
for(cur=*head;cur->next!=NULL;cur=cur->next);
cur->next=temp;
temp->prev=cur;
temp->value=value;
}
*count = *count + 1;
}
void print_list(char *listName, struct node *head)
{
mynode *temp;
printf("NULL\n");
}
As you can see, the code becomes quite messy because of the pointer logic. Thats why I have not elaborated too
much on the code, nor on variations such as sorting a doubly linked list. You have to do it yourself once to
understand it.
#include<stdio.h>
#include<ctype.h>
a = head1;
b = head2;
c = (struct node *)NULL;
newHead = (struct node*)NULL;
if(a==NULL)return(b);
else if(b==NULL)return(a);
if(c == NULL)
{
c = a;
}
else
{
c->next = a;
c = c->next;
}
a = a->next;
b = b->next;
}
// Make sure the new head is set...
if(newHead == NULL)
newHead = c;
if(a==NULL)
c->next = b;
else if(b==NULL)
c->next = a;
return(newHead);
if (source==NULL || source->next==NULL)
{
// length < 2 cases
*frontRef = source;
*backRef = NULL;
}
else
{
slow = source;
fast = source->next;
// Advance 'fast' two nodes, and advance 'slow' one node
while (fast != NULL)
{
fast = fast->next;
if (fast != NULL)
{
slow = slow->next;
fast = fast->next;
}
}
// 'slow' is before the midpoint in the list, so split it in two
// at that point.
*frontRef = source;
*backRef = slow->next;
slow->next = NULL;
}
}
if(*head == NULL)
{
*head=temp;
temp->value=value;
}
else
{
for(cur=*head;cur->next!=NULL;cur=cur->next);
cur->next=temp;
temp->prev=cur;
temp->value=value;
}
}
printf("NULL\n");
The code to merge two already sorted sub-linked lists into a sorted linked list could be either iterative or
recursive. You already saw the iterative version above. Here is a recursive version of the same...
Recursive solution to merge two already sorted linked lists into a single linked list
if (a==NULL) return(b);
else if (b==NULL) return(a);
return(result);
}
Also, see how the splitLLInto2() function uses the same technique used to find the middle of a linked list to split
a linked list into two without having to keep a count of the number of nodes in the linkes list!
Here is another solution (not that great, though) to split a linked list into two. It used the count of the number of
nodes to decide where to split
if (len < 2)
{
*frontRef = source;
*backRef = NULL;
}
else
{
int hopCount = (len-1)/2;
Using recursive stack space proportional to the length of a list is not recommended. However, the recursion in
this case is ok ? it uses stack space which is proportional to the log of the length of the list. For a 1000 node list,
the recursion will only go about 10 deep. For a 2000 node list, it will go 11 deep. If you think about it, you can
see that doubling the size of the list only increases the depth by 1.
struct node {
int value;
struct node *next;
};
typedef struct node *mynode;
typedef struct {
int value;
mynode next;
} *mynode;
The typedef is not defined at the point where the "next" field is declared.
struct node {
int value;
struct node next;
};
typedef struct node mynode;
You can only have pointer to structures, not the structure itself as its recursive!
Here is a C program which implements a generic linked list. This is also one of the very popular interview
questions thrown around. The crux of the solution is to use the void C pointer to make it generic. Also notice
how we use function pointers to pass the address of different functions to print the different generic data.
#include <stdio.h>
#include <stdlib.h>
#include <string.h>
struct check {
int i;
char c;
double d;
} chk[] = { { 1, 'a', 1.1 },
{ 2, 'b', 2.2 },
{ 3, 'c', 3.3 } };
int main(void)
{
char c[] = { 'a', 'b', 'c', 'd' };
int i[] = { 1, 2, 3, 4 };
char *str[] = { "hello1", "hello2", "hello3", "hello4" };
printf("Printing characters:");
print(list1, printchar);
printf(" : done\n\n");
printf("Printing integers:");
print(list2, printint);
printf(" : done\n\n");
printf("Printing strings:");
print(list3, printstr);
printf(" : done\n\n");
printf("Printing composite:");
print(list4, printcomp);
printf(" : done\n");
return 0;
}
One way is to reverse the data in the nodes without changing the pointers themselves. One can also create a new
linked list which is the reverse of the original linked list. A simple C program can do that for you. Please note
that you would still use the "next" pointer fields to traverse through the linked list (So in effect, you are using
the pointers, but you are not changing them when reversing the linked list).
How would you detect a loop in a linked list? Write a C program to detect a loop in a linked list.
There are multiple answers to this problem. Here are a few C programs to attack this problem.
Have a double loop, where you check the node pointed to by the outer loop, with every node of the inner loop.
while(current->next != NULL)
{
mynode *temp = head;
while(temp->next != NULL && temp != current)
{
if(current->next == temp)
{
printf("\nFound a loop.");
return current;
}
temp = temp->next;
}
current = current->next;
}
return NULL;
}
Visited flag
Have a visited flag in each node of the linked list. Flag it as visited when you reach the node. When you reach a
node and the flag is already flagged as visited, then you know there is a loop in the linked list.
Fastest method
Have 2 pointers to start of the linked list. Increment one pointer by 1 node and the other by 2 nodes. If there's a
loop, the 2nd pointer will meet the 1st pointer somewhere. If it does, then you know there's one.
p=head;
q=head->next;
// No loop.
How do you find the middle of a linked list? Write a C program to return the middle of a linked list?
Here are a few C program snippets to give you an idea of the possible solutions.
#include<stdio.h>
#include<ctype.h>
print_list("myList", head);
getTheMiddle(head);
getch();
return(0);
}
if(q!=NULL)
{
while((q->next)!=NULL && (q->next->next)!=NULL)
{
p=(p!=(mynode *)NULL?p->next:(mynode *)NULL);
q=(q!=(mynode *)NULL?q->next:(mynode *)NULL);
q=(q!=(mynode *)NULL?q->next:(mynode *)NULL);
}
printf("The middle element is [%d]",p->value);
}
}
if(*head == NULL)
{
*head=temp;
temp->value=value;
}
else
{
for(cur=*head;cur->next!=NULL;cur=cur->next);
cur->next=temp;
temp->prev=cur;
temp->value=value;
}
}
printf("NULL\n");
}
Here p moves one step, where as q moves two steps, when q reaches end, p will be at the middle of the linked
list.
Method2(Uses a counter)
#include<stdio.h>
#include<ctype.h>
print_list("myList", head);
middle = getTheMiddle(head);
printf("\nMiddle node -> [%d]\n\n", middle->value);
getch();
return(0);
}
if(*head == NULL)
{
*head=temp;
temp->value=value;
}
else
{
for(cur=*head;cur->next!=NULL;cur=cur->next);
cur->next=temp;
temp->prev=cur;
temp->value=value;
}
}
printf("NULL\n");
In a similar way, we can find the 1/3 th node of linked list by changing (i%2==1) to (i%3==1) and in the same
way we can find nth node of list by changing (i%2==1) to (i%n==1) but make sure ur (n<=i).
If you are using C language to implement the heterogeneous linked list, what pointer type will you use?
The heterogeneous linked list contains different data types in its nodes and we need a link, pointer to connect
them. It is not possible to use ordinary pointers for this. So we go for void pointer. Void pointer is capable of
storing pointer to any type as it is a generic pointer type.
Check out the C program to implement a Generic linked list in the same FAQ.
How to compare two linked lists? Write a C program to compare two linked lists.
Another way is to do it on similar lines as strcmp() compares two strings, character by character (here each node
is like a character).
How to create a copy of a linked list? Write a C program to create a copy of a linked list.
Check out this C program which creates an exact copy of a linked list.
Before looking at the answer, try writing a simple C program (with a for loop) to do this. Quite a few people get
this wrong.
This is the wrong way to do it
If you are thinking why the above piece of code is wrong, note that once you free the listptr node, you cannot do
something like listptr = listptr->next!. Since listptr is already freed, using it to get listptr->next is illegal and can
cause unpredictable results!
After doing this, make sure you also set the head pointer to NULL!
The answer is ofcourse, you can write a C program to do this. But, the question is, do you really think it will be
as efficient as a C program which does a binary search on an array?
Do you know what exactly makes the binary search on an array so fast and efficient? Its the ability to access any
element in the array in constant time. This is what makes it so fast. You can get to the middle of the array just by
saying array[middle]!. Now, can you do the same with a linked list? The answer is No. You will have to write
your own, possibly inefficient algorithm to get the value of the middle node of a linked list. In a linked list, you
loosse the ability to get the value of any node in a constant time.
One solution to the inefficiency of getting the middle of the linked list during a binary search is to have the first
node contain one additional pointer that points to the node in the middle. Decide at the first node if you need to
check the first or the second half of the linked list. Continue doing that with each half-list.
Write a C program to return the nth node from the end of a linked list.
Here is a solution which is often called as the solution that uses frames.
Suppose one needs to get to the 6th node from the end in this LL. First, just keep on incrementing the first
pointer (ptr1) till the number of increments cross n (which is 6 in this case)
STEP 1 : 1(ptr1,ptr2) -> 2 -> 3 -> 4 -> 5 -> 6 -> 7 -> 8 -> 9 -> 10
STEP 2 : 1(ptr2) -> 2 -> 3 -> 4 -> 5 -> 6(ptr1) -> 7 -> 8 -> 9 -> 10
Now, start the second pointer (ptr2) and keep on incrementing it till the first pointer (ptr1) reaches the end of the
LL.
STEP 3 : 1 -> 2 -> 3 -> 4(ptr2) -> 5 -> 6 -> 7 -> 8 -> 9 -> 10 (ptr1)
So here you have!, the 6th node from the end pointed to by ptr2!
struct node
{
int data;
struct node *next;
}mynode;
if(!head)
{
return(NULL);
}
ptr1 = head;
ptr2 = head;
count = 0;
while(count < n)
{
count++;
if((ptr1=ptr1->next)==NULL)
{
//Length of the linked list less than n. Error.
return(NULL);
}
}
while((ptr1=ptr1->next)!=NULL)
{
ptr2=ptr2->next;
}
return(ptr2);
}
How would you find out if one of the pointers in a linked list is corrupted or not?
This is a really good interview question. The reason is that linked lists are used in a wide variety of scenarios
and being able to detect and correct pointer corruptions might be a very valuable tool. For example, data blocks
associated with files in a file system are usually stored as linked lists. Each data block points to the next data
block. A single corrupt pointer can cause the entire file to be lost!
Discover and fix bugs when they corrupt the linked list and not when effect becomes visible in some
other part of the program. Perform frequent consistency checks (to see if the linked list is indeed
holding the data that you inserted into it).
It is good programming practice to set the pointer value to NULL immediately after freeing the
memory pointed at by the pointer. This will help in debugging, because it will tell you that the object
was freed somewhere beforehand. Keep track of how many objects are pointing to a object using
reference counts if required.
Use a good debugger to see how the datastructures are getting corrupted and trace down the problem.
Debuggers like ddd on linux and memory profilers like Purify, Electric fence are good starting points.
These tools should help you track down heap corruption issues easily.
Avoid global variables when traversing and manipulating linked lists. Imagine what would happen if a
function which is only supposed to traverse a linked list using a global head pointer accidently sets the
head pointer to NULL!.
Its a good idea to check the addNode() and the deleteNode() routines and test them for all types of
scenarios. This should include tests for inserting/deleting nodes at the front/middle/end of the linked
list, working with an empty linked list, running out of memory when using malloc() when allocating
memory for new nodes, writing through NULL pointers, writing more data into the node fields then
they can hold (resulting in corrupting the (probably adjacent) "prev" and "next" pointer fields), make
sure bug fixes and enhancements to the linked list code are reviewed and well tested (a lot of bugs
come from quick and dirty bug fixing), log and handle all possible errors (this will help you a lot while
debugging), add multiple levels of logging so that you can dig through the logs. The list is endless...
Each node can have an extra field associated with it. This field indicates the number of nodes after this
node in the linked list. This extra field needs to be kept up-to-date when we inserte or delete nodes in
the linked list (It might become slightly complicated when insertion or deletion happens not at end, but
anywhere in the linked list). Then, if for any node, p->field > 0 and p->next == NULL, it surely points
to a pointer corruption.
You could also keep the count of the total number of nodes in a linked list and use it to check if the list
is indeed having those many nodes or not.
The problem in detecting such pointer corruptions in C is that its only the programmer who knows that the
pointer is corrupted. The program has no way of knowing that something is wrong. So the best way to fix these
errors is check your logic and test your code to the maximum possible extent. I am not aware of ways in C to
recover the lost nodes of a corrupted linked list. C does not track pointers so there is no good way to know if an
arbitrary pointer has been corrupted or not. The platform may have a library service that checks if a pointer
points to valid memory (for instance on Win32 there is a IsBadReadPtr, IsBadWritePtr API.) If you detect a
cycle in the link list, it's definitely bad. If it's a doubly linked list you can verify, pNode->Next->Prev == pNode.
I have a hunch that interviewers who ask this question are probably hinting at something called Smart Pointers
in C++. Smart pointers are particularly useful in the face of exceptions as they ensure proper destruction of
dynamically allocated objects. They can also be used to keep track of dynamically allocated objects shared by
multiple owners. This topic is out of scope here, but you can find lots of material on the Internet for Smart
Pointers.
The solution is to iterate down the list looking for the correct place to insert the new node. That could be the end
of the list, or a point just before a node which is larger than the new node.
Note that we assume the memory for the new node has already been allocated and a pointer to that memory is
being passed to this function.
As the linked list is sorted, we can start from the beginning of the list and compare adjacent nodes. When
adjacent nodes are the same, remove the second one. There's a tricky case where the node after the next node
needs to be noted before the deletion.
Use Recursion.
The only way to search a linked list is with a linear search, because the only way a linked lists members can be
accessed is sequentially. Sometimes it is quicker to take the data from a linked list and store it in a different data
structure so that searches can be more efficient.
Write your own....
Here is a C program which explains a different way of coding the atoi() function in the C language.
#include<stdio.h>
// Dont increment i!
}
return(i);
}
Try working it out with a small string like "1998", you will find out it does work!.
This can be done either by going from right to left or left to right in the string
if (string)
{
while (*string && (*string <= '9' && *string >= '0'))
{
value = (value * 10) + (*string - '0');
string++;
}
}
return value;
}
Note that these functions have no error handling incorporated in them (what happens if someone passes non-
numeric data (say "1A998"), or negative numeric strings (say "-1998")). I leave it up to you to add these cases.
The essense is to understand the core logic first.
Implement the memmove() function. What is the difference between the memmove() and memcpy() function?
memmove() offers guaranteed behavior if the source and destination arguments overlap. memcpy() makes no
such guarantee, and may therefore be more efficient to implement. It's always safer to use memmove().
Here is an implementation..
#include <stdio.h>
#include <string.h>
printf("\n--------------------------------\n");
/* ----------------------------------------
*
* CASE 1 : From (SRC) < To (DEST)
*
* +--+---------------------+--+
* | | | |
* +--+---------------------+--+
* ^ ^
* | |
* From To
*
* --------------------------------------- */
p1 = (char *) malloc(12);
memset(p1,12,'\0');
size=10;
strcpy(p1,"ABCDEFGHI");
p2 = p1 + 2;
printf("\n--------------------------------\n");
printf("\nFrom (before) = [%s]",p1);
printf("\nTo (before) = [%s]",p2);
mymemmove(p2,p1,size);
printf("\n--------------------------------\n");
/* ----------------------------------------
*
* CASE 2 : From (SRC) > To (DEST)
*
* +--+---------------------+--+
* | | | |
* +--+---------------------+--+
* ^ ^
* | |
* To From
*
* --------------------------------------- */
p3 = (char *) malloc(12);
memset(p3,12,'\0');
p4 = p3 + 2;
strcpy(p4, "ABCDEFGHI");
printf("\n--------------------------------\n");
/* ----------------------------------------
*
* CASE 3 : No overlap
*
* --------------------------------------- */
p1 = (char *) malloc(30);
memset(p1,30,'\0');
size=10;
strcpy(p1,"ABCDEFGHI");
p2 = p1 + 15;
printf("\n--------------------------------\n");
printf("\nFrom (before) = [%s]",p1);
printf("\nTo (before) = [%s]",p2);
mymemmove(p2,p1,size);
printf("\n--------------------------------\n");
printf("\n\n");
return 0;
}
p2 = p2 + size;
if (p2 != from)
{
// Overlap detected!
p2 = (const unsigned char *) from;
p2 = p2 + size;
p1 = p1 + size;
while (size-- != 0)
{
*--p1 = *--p2;
}
}
else
{
// No overlap OR they overlap as CASE 2 above.
// memcopy() would have done this directly.
while (size-- != 0)
{
*p1++ = *p2++;
}
}
return(to);
}
--------------------------------
--------------------------------
--------------------------------
--------------------------------
So then, whats the difference between the implementation of memmove() and memcpy(). Its just that memcpy()
will not care if the memories overlap and will either copy from left to right or right to left without checking
which method to used depending on the type of the overlap. Also note that the C code proves that the results are
the same irrespective of the Endian-ness of the machine.
There are a number of ways to find a string inside another string. Its important to be aware of these algorithms
than to memorize them. Some of the fastest algorithms are quite tough to understand!.
Method1
The first method is the classic Brute force method. The Brute Force algorithm checks, at all positions in the text
between 0 and (n-m), if an occurrence of the pattern starts at that position or not. Then, after each successfull or
unsuccessful attempt, it shifts the pattern exactly one position to the right. The time complexity of this searching
phase is O(mn). The expected number of text character comparisons is 2n.
Here 'n' is the size of the string in which the substring of size 'm' is being searched for.
#include<stdio.h>
/* Searching */
for (j = 0; j <= (n - m); ++j)
{
for (i = 0; i < m && x[i] == y[i + j]; ++i);
if (i >= m) {printf("\nMatch found at\n\n->[%d]\n->[%s]\n",j,y+j);}
}
}
int main()
{
char *string = "hereroheroero";
char *pattern = "hero";
BF(pattern,strlen(pattern),string,strlen(string));
printf("\n\n");
return(0);
}
hereroheroero
!
hero
hereroheroero
!
hero
hereroheroero
!
hero
hereroheroero
!
hero
hereroheroero
!
hero
hereroheroero
!
hero
hereroheroero
|||| ----> Match!
hero
hereroheroero
!
hero
hereroheroero
!
hero
hereroheroero
!
hero
Method2
Instead of checking at each position of the text if the pattern occurs or not, it is better to check first if the
contents of the current string "window" looks like the pattern or not. In order to check the resemblance between
these two patterns, a hashing function is used. Hashing a string involves computing a numerical value from the
value of its characters using a hash function.
The Rabin-Karp method uses the rule that if two strings are equal, their hash values must also be equal. Note
that the converse of this statement is not always true, but a good hash function tries to reduce the number of
such hash collisions. Rabin-Karp computes hash value of the pattern, and then goes through the string
computing hash values of all of its substrings and checking if the pattern's hash value is equal to the substring
hash value, and advancing by 1 character every time. If the two hash values are the same, then the algorithm
verifies if the two string really are equal, rather than this being a fluke of the hashing scheme. It uses regular
string comparison for this final check. Rabin-Karp is an algorithm of choice for multiple pattern search. If we
want to find any of a large number, say k, fixed length patterns in a text, a variant Rabin-Karp that uses a hash
table to check whether the hash of a given string belongs to a set of hash values of patterns we are looking for.
Other algorithms can search for a single pattern in time order O(n), hence they will search for k patterns in time
order O(n*k). The variant Rabin-Karp will still work in time order O(n) in the best and average case because a
hash table allows to check whether or not substring hash equals any of the pattern hashes in time order of O(1).
#include<stdio.h>
hashing_function()
{
// A hashing function to compute the hash values of the strings.
....
}
printf("\nstring : [%s]"
"\nlength : [%d]"
"\npattern : [%s]"
"\nlength : [%d]\n\n", y,n,x,m);
/* Preprocessing phase */
Do preprocessing here..
/* Searching */
j = 0;
while (j <= n-m)
{
if (hx == hy && memcmp(x, y + j, m) == 0)
{
// Hashes match and so do the actual strings!
printf("\nMatch found at : [%d]\n",j);
}
int main()
{
char *string="hereroheroero";
char *pattern="hero";
KarpRabin(pattern,strlen(pattern),string,strlen(string));
printf("\n\n");
return(0);
}
hereroheroero
!
hero
hereroheroero
!
hero
hereroheroero
!
hero
hereroheroero
!
hero
hereroheroero
!
hero
hereroheroero
!
hero
hereroheroero
|||| ----> Hash values match, so do the strings!
hero
hereroheroero
!
hero
hereroheroero
!
hero
hereroheroero
!
hero
Method3
The Knuth-Morris-Pratt or the Morris-Pratt algorithms are extensions of the basic Brute Force algorithm. They
use precomputed data to skip forward not by 1 character, but by as many as possible for the search to succeed.
Here is some code
while (i < m)
{
while (j > -1 && x[i] != x[j])
j = Next[j];
Next[++i] = ++j;
}
}
/* Preprocessing */
preComputeData(x, m, Next);
/* Searching */
i = j = 0;
while (j < n)
{
while (i > -1 && x[i] != y[j])
i = Next[i];
i++;
j++;
if (i >= m)
{
printf("\nMatch found at : [%d]\n",j - i);
i = Next[i];
}
}
}
int main()
{
char *string="hereroheroero";
char *pattern="hero";
MorrisPrat(pattern,strlen(pattern),string,strlen(string));
printf("\n\n");
return(0);
}
hereroheroero
!
hero
hereroheroero
!
hero
hereroheroero
!
hero
hereroheroero
!
hero
hereroheroero
|||| ----> Match found!
hero
hereroheroero
!
hero
Method4
The Boyer Moore algorithm is the fastest string searching algorithm. Most editors use this algorithm.
It compares the pattern with the actual string from right to left. Most other algorithms compare from left to right.
If the character that is compared with the rightmost pattern symbol does not occur in the pattern at all, then the
pattern can be shifted by m positions behind this text symbol.
Example:
0 1 2 3 4 5 6 7 8 9 ...
abbadabacba
||
babac|
<------ |
|
babac
The comparison of "d" with "c" at position 4 does not match. "d" does not occur in the pattern. Therefore, the
pattern cannot match at any of the positions 0,1,2,3,4, since all corresponding windows contain a "d". The
pattern can be shifted to position 5. The best case for the Boyer-Moore algorithm happens if, at each search
attempt the first compared character does not occur in the pattern. Then the algorithm requires only O(n/m)
comparisons .
This method is called bad character heuristics. It can also be applied if the bad character (the character that
causes a mismatch), occurs somewhere else in the pattern. Then the pattern can be shifted so that it is aligned to
this text symbol. The next example illustrates this situation.
Example:
0 1 2 3 4 5 6 7 8 9 ...
abbababacba
|
babac
<----
|
babac
Comparison between "b" and "c" causes a mismatch. The character "b" occurs in the pattern at positions 0 and 2.
The pattern can be shifted so that the rightmost "b" in the pattern is aligned to "b".
Sometimes the bad character heuristics fails. In the following situation the comparison between "a" and "b"
causes a mismatch. An alignment of the rightmost occurence of the pattern symbol a with the text symbol a
would produce a negative shift. Instead, a shift by 1 would be possible. However, in this case it is better to
derive the maximum possible shift distance from the structure of the pattern. This method is called good suffix
heuristics.
Example:
0 1 2 3 4 5 6 7 8 9 ...
abaababacba
|||
cabab
<----
|||
cabab
The suffix "ab" has matched. The pattern can be shifted until the next occurence of ab in the pattern is aligned to
the text symbols ab, i.e. to position 2.
In the following situation the suffix "ab" has matched. There is no other occurence of "ab" in the
pattern.Therefore, the pattern can be shifted behind "ab", i.e. to position 5.
Example:
0 1 2 3 4 5 6 7 8 9 ...
abcababacba
|||
cbaab
cbaab
In the following situation the suffix "bab" has matched. There is no other occurence of "bab" in the pattern. But
in this case the pattern cannot be shifted to position 5 as before, but only to position 3, since a prefix of the
pattern "ab" matches the end of "bab". We refer to this situation as case 2 of the good suffix heuristics.
Example:
0 1 2 3 4 5 6 7 8 9 ...
aabababacba
||||
abbab
abbab
The pattern is shifted by the longest of the two distances that are given by the bad character and the good suffix
heuristics.
The Boyer-Moore algorithm uses two different heuristics for determining the maximum possible shift distance
in case of a mismatch: the "bad character" and the "good suffix" heuristics. Both heuristics can lead to a shift
distance of m. For the bad character heuristics this is the case, if the first comparison causes a mismatch and the
corresponding text symbol does not occur in the pattern at all. For the good suffix heuristics this is the case, if
only the first comparison was a match, but that symbol does not occur elsewhere in the pattern.
A lot of these algorithms have been explained here with good visualizations. Remember, again that its sufficient
to know the basic Brute force algorithm and be aware of the other methods. No one expects you to know every
possible algorithm on earth.
This is again one of the most frequently asked interview questions. Here is a C program which implements a
basic version of printf(). This is a really, really simplified version of printf(). Note carefully how floating point
and other compilcated support has been left out. Also, note how we use low level puts() and putchar(). Dont
make a fool of yourself by using printf() within the implementation of printf()!
#include<stdio.h>
#include<stdarg.h>
main()
{
void myprintf(char *,...);
char * convert(unsigned int, int);
int i=65;
char str[]="This is my string";
myprintf("\nMessage = %s%d%x",str,i,i);
}
char *p;
int i;
unsigned u;
char *s;
va_list argp;
va_start(argp, fmt);
p=fmt;
for(p=fmt; *p!='\0';p++)
{
if(*p=='%')
{
putchar(*p);continue;
}
p++;
switch(*p)
{
case 'c' : i=va_arg(argp,int);putchar(i);break;
case 'd' : i=va_arg(argp,int);
if(i<0){i=-i;putchar('-');}puts(convert(i,10));break;
case 'o': i=va_arg(argp,unsigned int); puts(convert(i,8));break;
case 's': s=va_arg(argp,char *); puts(s); break;
case 'u': u=va_arg(argp,argp, unsigned int); puts(convert(u,10));break;
case 'x': u=va_arg(argp,argp, unsigned int); puts(convert(u,16));break;
case '%': putchar('%');break;
}
}
va_end(argp);
}
ptr=&buf[sizeof(buff)-1];
*ptr='\0';
do
{
*--ptr="0123456789abcdef"[num%base];
num/=base;
}while(num!=0);
return(ptr);
}
Here are some C programs which implement the strcpy() function. This is one of the most frequently asked C
interview questions.
Method1
The strcpy function copies src, including the terminating null character, to the location specified by dst. No
overflow checking is performed when strings are copied or appended. The behavior of strcpy is undefined if the
source and destination strings overlap. It returns the destination string. No return value is reserved to indicate an
error.
Method2
There are many ways one can implement the strcmp() function. Note that strcmp(str1,str2) returns a -ve number
if str1 is alphabetically above str2, 0 if both are equal and +ve if str2 is alphabetically above str1.
Here are some C programs which implement the strcmp() function. This is also one of the most frequently asked
interview questions. The prototype of strcmp() is
#include <stdio.h>
int main()
{
printf("\nstrcmp() = [%d]\n", mystrcmp("A","A"));
printf("\nstrcmp() = [%d]\n", mystrcmp("A","B"));
printf("\nstrcmp() = [%d]\n", mystrcmp("B","A"));
return(0);
}
int main()
{
char str1[] = "India";
char str2[25];
#include <stdio.h>
#include <conio.h>
int main()
{
char subStr[100];
char str[]="My Name Is Sweet";
file_path_from = "<something>";
file_path_to = "<something_else>";
if (!feof(f_from)){exit(1);}
return(0);
}
toUpper()
isUpper()
Its important to know that the upper and lower case alphabets have corresponding integer values.
A-Z - 65-90
a-z - 97-122
Another way to do this conversion is to maintain a correspondance between the upper and lower case alphabets.
The program below does that. This frees us from the fact that these alphabets have a corresponding integer
values. I dont know what one should do for non-english alphabets. Do other languages have upper and lower
case letters in the first place :) !
#include <string.h>
int toUpper(int c)
{
const char *upper;
const char *const lower = LOWER;
// Get the position of the lower case alphabet in the LOWER string using the strchr() function ..
upper = ( ((CHAR_MAX >= c)&&(c > '\0')) ? strchr(lower, c) : NULL);
// Now return the corresponding alphabet at that position in the UPPER string ..
return((upper != NULL)?UPPER[upper - lower] : c);
}
Note that these routines dont have much error handling incorporated in them. Its really easy to add error
handling to these routines or just leave it out (as I like it). This site consciously leaves out error handling for
most of the programs to prevent unwanted clutter and present the core logic first.
while(*p!='\0')
p++;
return(p-s);
}
if (s == NULL || t == NULL)
return s; /* we need not have to do anything */
while (*s)
s++;
return p;
}
Programs
int a,b,t;
t = a;
a = b;
b = t;
There is no way better than this as you will find out soon. There are a few slick expressions that do swap
variables without using temporary storage. But they come with their own set of problems.
a ^= b ^= a ^= b;
Although the code above works fine for most of the cases, it tries to modify variable 'a' two times between
sequence points, so the behavior is undefined. What this means is it wont work in all the cases. This will also
not work for floating-point values. Also, think of a scenario where you have written your code like this
Now, if suppose, by mistake, your code passes the pointer to the same variable to this function. Guess what
happens? Since Xor'ing an element with itself sets the variable to zero, this routine will end up setting the
variable to zero (ideally it should have swapped the variable with itself). This scenario is quite possible in
sorting algorithms which sometimes try to swap a variable with itself (maybe due to some small, but not so fatal
coding error). One solution to this problem is to check if the numbers to be swapped are already equal to each
other.
Method2
a=a+b;
b=a-b;
a=a-b;
But, note that here also, if a and b are big and their addition is bigger than the size of an int, even this might end
up giving you wrong results.
Method3
One can also swap two variables using a macro. However, it would be required to pass the type of the variable
to the macro. Also, there is an interesting problem using macros. Suppose you have a swap macro which looks
something like this
int temp;
temp=temp;
temp=b;
b=temp;
Which means it sets the value of "b" to both the variables!. It never swapped them! Scary, isn't it?
So the moral of the story is, dont try to be smart when writing code to swap variables. Use a temporary variable.
Its not only fool proof, but also easier to understand and maintain.
The 8 queens problem is a classic problem using the chess board. This problem is to place 8 queens on the chess
board so that they do not attack each other horizontally, vertically or diagonally. It turns out that there are 12
essentially distinct solutions to this problem.
Suppose we have an array t[8] which keeps track of which column is occupied in which row of the chess board.
That is, if t[0]==5, then it means that the queen has been placed in the fifth column of the first row. We need to
couple the backtracking algorithm with a procedure that checks whether the tuple is completable or not, i.e. to
check that the next placed queen 'i' is not menaced by any of the already placed 'j' (j < i):
#include<stdio.h>
static int t[10]={-1};
void queens(int i);
int empty(int i);
void print_solution();
int main()
{
queens(1);
print_solution();
return(0);
}
void queens(int i)
{
for(t[i]=1;t[i]<=8;t[i]++)
{
if(empty(i))
{
if(i==8)
{
print_solution();
/* If this exit is commented, it will show ALL possible combinations */
exit(0);
}
else
{
// Recurse!
queens(i+1);
}
}// if
}// for
}
int empty(int i)
{
int j;
j=1;
return((i==j)?1:0);
}
void print_solution()
{
int i;
for(i=1;i<=8;i++)printf("\nt[%d] = [%d]",i,t[i]);
}
t[1] = [1] // This means the first square of the first row.
t[2] = [5] // This means the fifth square of the second row.
t[3] = [8] ..
t[4] = [6] ..
t[5] = [3] ..
t[6] = [7] ..
t[7] = [2] ..
t[8] = [4] // This means the fourth square of the last row.
Here is a C program to print a matrix helically. Printing a matrix helically means printing it in this spiral fashion
>-----------+
|
+---->--+ |
| | |
| | |
| <---+ |
| |
+-----------+
#include<stdio.h>
/* HELICAL MATRIX */
int main()
{
int arr[][4] = { {1,2,3,4},
{5,6,7,8},
{9,10,11,12},
{13, 14, 15, 16}
};
int i, j, k,middle,size;
printf("\n\n");
size = 4;
middle = (size-1)/2;
if (size % 2 == 1) printf("%d", arr[middle][middle]);
printf("\n\n");
return 1;
}
There are a number of ways one can reverse strings. Here are a few of them. These should be enough to impress
the interviewer! The methods span from recursive to non-recursive (iterative).
Also note that there is a similar question about reversing the words in a sentence, but still keeping the words in
place. That is
I am a good boy
would become
boy good a am I
This is dealt with in another question. Here I only concentrate on reversing strings. That is
I am a good boy
would become
yob doog a ma I
Method1 (Recursive)
#include <stdio.h>
if(pos<(strlen(str)/2))
{
char ch;
// Now recurse!
reverse(pos+1);
}
}
Method2
#include <stdio.h>
#include <malloc.h>
#include <string.h>
int main()
{
char buffer[]="This is Test";
ReverseStr(buffer,0,strlen(buffer)-1);
return 0;
}
Method3
Method4
Method5
Method6
public static String reverse(String s)
{
int N = s.length();
char[] a = new char[N];
for (int i = 0; i < N; i++)
a[i] = s.charAt(N-i-1);
String reverse = new String(a);
return reverse;
}
I am a good boy
boy good a am I
Method1
First reverse the whole string and then individually reverse the words
I am a good boy
<------------->
yob doog a ma I
<-> <--> <-> <-> <->
boy good a am I
/*
Algorithm..
#include <stdio.h>
return(0);
}
Method2
Another way to do it is, allocate as much memory as the input for the final output. Start from the right of the
string and copy the words one by one to the output.
Output : boy
: boy good
: boy good a
: boy good a am
: boy good a am I
The only problem to this solution is the extra space required for the output and one has to write this code really
well as we are traversing the string in the reverse direction and there is no null at the start of the string to know
we have reached the start of the string!. One can use the strtok() function to breakup the string into multiple
words and rearrange them in the reverse order later.
Method3
Now its a simple question of reversing the linked list!. There are plenty of algorithms to reverse a linked list
easily. This also keeps track of the number of spaces between the words. Note that the linked list algorithm,
though inefficient, handles multiple spaces between the words really well.
Iterative C program
#include <stdio.h>
#define SIZE 3
int main(char *argv[],int argc)
{
char list[3]={'a','b','c'};
int i,j,k;
for(i=0;i<SIZE;i++)
for(j=0;j<SIZE;j++)
for(k=0;k<SIZE;k++)
if(i!=j && j!=k && i!=k)
printf("%c%c%c\n",list[i],list[j],list[k]);
return(0);
}
Recursive C program
#include <stdio.h>
#define N 5
if(k==m)
{
/* PRINT A FROM k to m! */
for(i=0;i<N;i++){printf("%c",list[i]);}
printf("\n");
}
else
{
for(i=k;i<m;i++)
{
/* swap(a[i],a[m-1]); */
temp=list[i];
list[i]=list[m-1];
list[m-1]=temp;
permute(list,k,m-1);
/* swap(a[m-1],a[i]); */
temp=list[m-1];
list[m-1]=list[i];
list[i]=temp;
}
}
}
fact(int n)
{
int fact;
if(n==1)
return(1);
else
fact = n * fact(n-1);
return(fact);
}
Please note that there is no error handling added to this function (to check if n is negative or 0. Or if n is too
large for the system to handle). This is true for most of the answers in this website. Too much error handling and
standard compliance results in a lot of clutter making it difficult to concentrate on the crux of the solution. You
must ofcourse add as much error handling and comply to the standards of your compiler when you actually
write the code to implement these algorithms.
#include <stdio.h>
int main(int argc, char*argv[])
{
printf("\n[%d]\n",pow(5,4));
}
Also, the code above can be optimized still by calculating pow(z, (n/2)) only one time (instead of twice) and
using its value in the two return() expressions above.
#include<stdio.h>
#define TRUE 1
#define FALSE 0
int main()
{
char *string = "hereheroherr";
char *pattern = "*hero*";
if(wildcard(string, pattern)==TRUE)
{
printf("\nMatch Found!\n");
}
else
{
printf("\nMatch not found!\n");
}
return(0);
}
You are given a large array X of integers (both positive and negative) and you need to find the maximum sum
found in any contiguous subarray of X.
There are various methods to solve this problem, some are listed below
Brute force
maxSum = 0
for L = 1 to N
{
for R = L to N
{
sum = 0
for i = L to R
{
sum = sum + X[i]
}
maxSum = max(maxSum, sum)
}
}
O(N^3)
Quadratic
Note that sum of [L..R] can be calculated from sum of [L..R-1] very easily.
maxSum = 0
for L = 1 to N
{
sum = 0
for R = L to N
{
sum = sum + X[R]
maxSum = max(maxSum, sum)
}
}
Using divide-and-conquer
O(N log(N))
maxSum(L, R)
{
if L > R then
return 0
if L = R then
return max(0, X[L])
M = (L + R)/2
sum = 0; maxToLeft = 0
for i = M downto L do
{
sum = sum + X[i]
maxToLeft = max(maxToLeft, sum)
}
sum = 0; maxToRight = 0
for i = M to R do
{
sum = sum + X[i]
maxToRight = max(maxToRight, sum)
}
#include<stdio.h>
#define N 10
int maxSubSum(int left, int right);
int list[N] = {11, -12, 15, -3, 8, -9, 1, 8, 10, -2};
int main()
{
int i,j,k;
int maxSum, sum;
/*---------------------------------------
* CUBIC - O(n*n*n)
*---------------------------------------*/
maxSum = 0;
for(i=0; i<N; i++)
{
for(j=i; j<N; j++)
{
sum = 0;
for(k=i ; k<j; k++)
{
sum = sum + list[k];
}
maxSum = (maxSum>sum)?maxSum:sum;
}
}
/*-------------------------------------
* Quadratic - O(n*n)
* ------------------------------------ */
maxSum = 0;
for(i=0; i<N; i++)
{
sum=0;
for(j=i; j<N ;j++)
{
sum = sum + list[j];
maxSum = (maxSum>sum)?maxSum:sum;
}
}
/*----------------------------------------
* Divide and Conquer - O(nlog(n))
* -------------------------------------- */
return(0);
}
if(left>right){return 0;}
if(left==right){return((0>list[left])?0:list[left]);}
mid = (left + right)/2;
sum=0;
maxToLeft=0;
for(i=mid; i>=left; i--)
{
sum = sum + list[i];
maxToLeft = (maxToLeft>sum)?maxToLeft:sum;
}
sum=0;
maxToRight=0;
for(i=mid+1; i<=right; i++)
{
sum = sum + list[i];
maxToRight = (maxToRight>sum)?maxToRight:sum;
}
return(((maxCrossing>maxInA)?maxCrossing:maxInA)>maxInB?((maxCrossing>maxInA)?maxCrossing:maxI
nA):maxInB);
}
Note that, if the array has all negative numbers, then this code will return 0. This is wrong because it should
return the maximum sum, which is the least negative integer in the array. This happens because we are setting
maxSum to 0 initially. A small change in this code can be used to handle such cases.
How to generate fibonacci numbers? How to find out if a given number is a fibonacci number or not? Write C
programs to do both.
Here is an iterative way to generate fibonacci numbers and also return the nth number.
int fib(int n)
{
int f[n+1];
f[1] = f[2] = 1;
int fib(int n)
{
if (n <= 2) return 1
else return fib(n-1) + fib(n-2)
}
Here is an iterative way to just compute and return the nth number (without storing the previous numbers).
int fib(int n)
{
int a = 1, b = 1;
for (int i = 3; i <= n; i++)
{
int c = a + b;
a = b;
b = c;
}
return a;
}
There are a few slick ways to generate fibonacci numbers, a few of them are listed below
Method1
n
[11] = [ F(n+1) F(n) ]
[10] [ F(n) F(n-1) ]
or
or
n
(f(0) f(1)) [ 0 1 ] = (f(n) f(n+1))
[11]
The n-th power of the 2 by 2 matrix can be computed efficiently in O(log n) time. This implies an O(log n)
algorithm for computing the n-th Fibonacci number.
Here is the pseudocode for this
int fib(int n)
{
matrixpower(n-1);
return Matrix[0][0];
}
void matrixpower(int n)
{
if (n > 1)
{
matrixpower(n/2);
Matrix = Matrix * Matrix;
}
if (n is odd)
{
Matrix = Matrix * {{1,1}{1,0}}
}
}
#include<stdio.h>
int M[2][2]={{1,0},{0,1}};
int A[2][2]={{1,1},{1,0}};
int C[2][2]={{0,0},{0,0}}; // Temporary matrix used for multiplication.
int main()
{
int n;
n=6;
matMul(n-1);
void matMul(int n)
{
if(n>1)
{
matMul(n/2);
mulM(0); // M * M
}
if(n%2!=0)
{
mulM(1); // M * {{1,1}{1,0}}
}
}
void mulM(int m)
{
int i,j,k;
if(m==0)
{
// C = M * M
for(i=0;i<2;i++)
for(j=0;j<2;j++)
{
C[i][j]=0;
for(k=0;k<2;k++)
C[i][j]+=M[i][k]*M[k][j];
}
}
else
{
// C = M * {{1,1}{1,0}}
for(i=0;i<2;i++)
for(j=0;j<2;j++)
{
C[i][j]=0;
for(k=0;k<2;k++)
C[i][j]+=A[i][k]*M[k][j];
}
}
Method2
The cumbersome way is to generate fibonacci numbers till this number and see if this number is one of them.
But there is another slick way to check if a number is a fibonacci number or not.
To check if a number is a perfect square or not, one can take the square root, round it to the nearest integer and
then square the result. If this is the same as the original whole number then the original was a perfect square.
This is one of the classical problems of computer science. There is a rat trapped in a maze. There are multiple
paths in the maze from the starting point to the ending point. There is some cheese at the exit. The rat starts from
the entrance of the maze and wants to get to the cheese.
111111111111111111111
100000000000000000001
100000010000000000001
100000010000000000001
100000000100001000001
100001000010000000001
100000000100000000001
100000000000000000001
111111111111111111111
The rat can move in four directions at any point in time (well, right, left, up, down). Please note that the
rat can't move diagonally. Imagine a real maze and not a matrix. In matrix language
o Moving right means adding {0,1} to the current coordinates.
o Moving left means adding {0,-1} to the current coordinates.
o Moving up means adding {-1,0} to the current coordinates.
o Moving right means adding {1,0} to the current coordinates.
The rat can start off at the first row and the first column as the entrance point.
From there, it tries to move to a cell which is currently free. A cell is free if it has a zero in it.
It tries all the 4 options one-by-one, till it finds an empty cell. If it finds one, it moves to that cell and
marks it with a 1 (saying it has visited it once). Then it continues to move ahead from that cell to other
cells.
If at a particular cell, it runs out of all the 4 options (that is it cant move either right, left, up or down),
then it needs to backtrack. It backtracks till a point where it can move ahead and be closer to the exit.
If it reaches the exit point, it gets the cheese, ofcourse.
The complexity is O(m*m).
Here is some pseudocode to chew upon
findpath()
{
Position offset[4];
Offset[0].row=0; offset[0].col=1;//right;
Offset[1].row=1; offset[1].col=0;//down;
Offset[2].row=0; offset[2].col=-1;//left;
Offset[3].row=-1; offset[3].col=0;//up;
Position here;
Here.row=1;
Here.col=1;
maze[1][1]=1;
int option = 0;
int lastoption = 3;
while(here.row!=m || here.col!=m)
{
//Find a neighbor to move
int r,c;
while (option<=LastOption)
{
r=here.row + offset[position].row;
c=here.col + offset[option].col;
if(maze[r][c]==0)break;
option++;
}
What Little-Endian and Big-Endian? How can I determine whether a machine's byte order is big-endian or little
endian? How can we convert from one to another?
Little Endian means that the lower order byte of the number is stored in memory at the lowest address, and the
higher order byte is stored at the highest address. That is, the little end comes first.
Base_Address+0 Byte0
Base_Address+1 Byte1
Base_Address+2 Byte2
Base_Address+3 Byte3
"Big Endian" means that the higher order byte of the number is stored in memory at the lowest address, and the
lower order byte at the highest address. The big end comes first.
Base_Address+0 Byte3
Base_Address+1 Byte2
Base_Address+2 Byte1
Base_Address+3 Byte0
In "Little Endian" form, code which picks up a 1, 2, 4, or longer byte number proceed in the same way for all
formats. They first pick up the lowest order byte at offset 0 and proceed from there. Also, because of the 1:1
relationship between address offset and byte number (offset 0 is byte 0), multiple precision mathematic routines
are easy to code. In "Big Endian" form, since the high-order byte comes first, the code can test whether the
number is positive or negative by looking at the byte at offset zero. Its not required to know how long the
number is, nor does the code have to skip over any bytes to find the byte containing the sign information. The
numbers are also stored in the order in which they are printed out, so binary to decimal routines are particularly
efficient.
int num = 1;
if(*(char *)&num == 1)
{
printf("\nLittle-Endian\n");
}
else
{
printf("Big-Endian\n");
}
And here is some code to convert from one Endian to another.
return((byte0 << 24) | (byte1 << 16) | (byte2 << 8) | (byte3 << 0));
}
main()
{
towers_of_hanio(n,'L','R','C');
}
char *myfunction(int n)
{
char buffer[20];
sprintf(buffer, "%d", n);
return retbuf;
}
char *myfunc1()
{
char temp[] = "string";
return temp;
}
char *myfunc2()
{
char temp[] = {'s', 't', 'r', 'i', 'n', 'g', '\0'};
return temp;
}
int main()
{
puts(myfunc1());
puts(myfunc2());
}
The returned pointer should be to a static buffer (like static char buffer[20];), or to a buffer passed in by the
caller function, or to memory obtained using malloc(), but not to a local array.
char *myfunc()
{
char *temp = "string";
return temp;
}
int main()
{
puts(someFun());
}
So will this
calling_function()
{
char *string;
return_string(&string);
printf(?\n[%s]\n?, string);
}
Write a C program which produces its own source code as its output
char*s="char*s=%c%s%c;main(){printf(s,34,s,34);}";main(){printf(s,34,s,34);}
So how does it work?
printf(f,34,f,34,10);
the parameter "f" not only acts as the format string, but also as a value for the %s specifier. The ASCII value of
double quotes is 34, and that of new-line is 10. With these fact ready, the solution is just a matter of tracing the
program.
Write a C progam to convert from decimal to any base (binary, hex, oct etc...)
#include <stdio.h>
int main()
{
decimal_to_anybase(10, 2);
decimal_to_anybase(255, 16);
getch();
}
while(n)
{
m=n%base;
digits[i]="0123456789abcdefghijklmnopqrstuvwxyz"[m];
n=n/base;
i++;
}
Even this is one of the most frequently asked interview questions. I really dont know whats so great in it.
Nevertheless, here is a C program
Method1
if(((~i+1)&i)==i)
{
//Power of 2!
}
#include <stdio.h>
int main()
{
printf("\nGCD(%2d,%2d) = [%d]", 6,4, gcd(6,4));
printf("\nGCD(%2d,%2d) = [%d]", 4,6, gcd(4,6));
printf("\nGCD(%2d,%2d) = [%d]", 3,17, gcd(3,17));
printf("\nGCD(%2d,%2d) = [%d]", 17,3, gcd(17,3));
printf("\nGCD(%2d,%2d) = [%d]", 1,6, gcd(1,6));
printf("\nGCD(%2d,%2d) = [%d]", 10,1, gcd(10,1));
printf("\nGCD(%2d,%2d) = [%d]", 10,6, gcd(10,6));
getch();
getch();
return(0);
}
// Iterative algorithm
int gcd(int a, int b)
{
int temp;
while(b)
{
temp = a % b;
a = b;
b = temp;
}
return(a);
}
// Recursive algorithm
int gcd_recurse(int a, int b)
{
int temp;
temp = a % b;
if (temp == 0)
{
return(b);
}
else
{
return(gcd_recurse(b, temp));
}
}
Iterative
----------------
GCD( 6, 4) = [2]
GCD( 4, 6) = [2]
GCD( 3,17) = [1]
GCD(17, 3) = [1]
GCD( 1, 6) = [1]
GCD(10, 1) = [1]
GCD(10, 6) = [2]
Recursive
----------------
GCD( 6, 4) = [2]
GCD( 4, 6) = [2]
GCD( 3,17) = [1]
GCD(17, 3) = [1]
GCD( 1, 6) = [1]
GCD(10, 1) = [1]
GCD(10, 6) = [2]
Note that you should add error handling to check if someone has passed negative numbers and zero.
#include<stdio.h>
#include<stdlib.h>
#include<conio.h>
int i,j=0,k,a1[10];
main()
{
printf("Enter the array of numbers between 1 and 100(you can repeat the numbers):");
for(i=0;i<=9;i++)
{
scanf("%d",&a1[i]);
}
while(j<10)
{
for(k=0;k<10;k++)
{
if(a1[j]==a1[k] && j!=k)
{
printf("Duplicate found!");
printf("The duplicate is %d\n",a1[j]);
getch();
}
}
j=j+1;
}
getch();
return(0);
}
There are a number of ways to remove duplicates from a sorted array. Here are a few C programs...
Method1
In this simple C program, we change the original array and also send the new size of the array back to the caller.
#include <stdio.h>
int my_array7[] = {1, 2, 8, 8 , 24, 60, 60, 60, 60, 75, 100, 100, 123};
int my_array7_size = 13;
return(0);
}
j = 0;
OLD : [1] [1] [2] [3] [5] [6] [6] [7] [10] [25] [100] [123] [123]
NEW : [1] [2] [3] [5] [6] [7] [10] [25] [100] [123]
OLD : [1] [2] [8] [8] [24] [60] [60] [60] [60] [75] [100] [100] [123]
NEW : [1] [2] [8] [24] [60] [75] [100] [123]
Method2
If we dont want to change the input array and just want to print the array without any duplicates, the solution is
very simple. Check out the removeDuplicatesNoModify() function in the program below. It keeps a track of the
most recently seen number and does not print any duplicates of it when traversing the sorted array.
#include <stdio.h>
int my_array7[] = {1, 2, 8, 8 , 24, 60, 60, 60, 60, 75, 100, 100, 123};
int my_array7_size = 13;
removeDuplicatesNoModify(my_array1, my_array1_size);
removeDuplicatesNoModify(my_array2, my_array2_size);
removeDuplicatesNoModify(my_array3, my_array3_size);
removeDuplicatesNoModify(my_array4, my_array4_size);
removeDuplicatesNoModify(my_array5, my_array5_size);
removeDuplicatesNoModify(my_array6, my_array6_size);
removeDuplicatesNoModify(my_array7, my_array7_size);
return(0);
}
last_seen_unique = array[0];
printf("\n");
}
Old : [ 1] [ 2] [ 3] [ 5] [ 6]
New : [ 1] [ 2] [ 3] [ 5] [ 6]
Old : [ 1] [ 1] [ 1] [ 1] [ 1]
New : [ 1]
Old : [ 1] [ 2] [ 8] [ 8] [24] [60] [60] [60] [60] [75] [100] [100] [123]
New : [ 1] [ 2] [ 8] [24] [60] [75] [100] [123]
Method3
Here is a slightly compilcated, but more visual version of the removeDuplicates() function. It shrinks the
original array as and when it find duplicates. It is also optimized to identify continuous strings of duplicates and
eliminate them at one shot.
#include <stdio.h>
int my_array7[] = {1, 2, 8, 8 , 24, 60, 60, 60, 60, 75, 100, 100, 123};
int my_array7_size = 13;
removeDuplicates(my_array1, &my_array1_size);
removeDuplicates(my_array2, &my_array2_size);
removeDuplicates(my_array3, &my_array3_size);
removeDuplicates(my_array4, &my_array4_size);
removeDuplicates(my_array5, &my_array5_size);
removeDuplicates(my_array6, &my_array6_size);
removeDuplicates(my_array7, &my_array7_size);
return(0);
}
// Changes the original array and resets the size of the array if duplicates
// have been removed.
if(*array_size == 1){return;}
current_pos = i;
dup_start = j;
j++;
dup_end = j-1;
}
}
}
printf("\n\n------------------------------------------------");
printf("\n\nFinal array (size : [%d])\n\n", *array_size);
for(i = 0; i < *array_size; i++)
{
printf("[%2d] ", array[i]);
}
printf("\n\n");
// This function prints the array with some special pointers to the numbers that
// are duplicated.
//
// Dont bother too much about this function, it just helps in understanding
// how and where the duplicates are being removed from.
void print_array(int array[], int array_size, int current_pos, int dup_start, int dup_end)
{
int i;
printf("\n\n");
for(i = 0; i < array_size; i++)
{
printf("[%2d] ", array[i]);
}
printf("\n");
printf("\n");
printf("\n");
C - Current position.
S - Start of duplicates.
E - End of duplicates.
Old : [ 1] [ 2] [ 8] [ 8] [24] [60] [60] [60] [60] [75] [100] [100] [123]
-------------------------------------------------------------------------
[ 1] [ 2] [ 8] [ 8] [24] [60] [60] [60] [60] [75] [100] [100] [123]
^ ^
| |
C S/E
-------------------------------------------------------------------------
New : [ 1] [ 2] [ 8] [24] [60] [75] [100] [123]
If there are other elegant methods of removing duplicate numbers from an array, please let me know!.
max = ((a>b)?((a>c)?a:c):((b>c)?b:c));
Here is another way
max = ((a>b)?a:b)>c?((a>b)?a:b):c;
Here is some code to find the max of 4 numbers...
Method1
#include <stdio.h>
#include <stdlib.h>
Method2
#include <stdio.h>
#include <stdlib.h>
int main()
{
int val = 0 ;
return 0;
}
This is one of the very popular interview questions, so take a good look at it!.
myfunction(int *ptr)
{
int myvar = 100;
ptr = &myvar;
}
main()
{
int *myptr;
myfunction(myptr);
Arguments in C are passed by value. The called function changed the passed copy of the pointer, and not the
actual pointer.
Pass in the address of the pointer to the function (the function needs to accept a pointer-to-a-pointer).
calling_function()
{
char *string;
return_string(/* Pass the address of the pointer */&string);
printf(?\n[%s]\n?, string);
}
Method2
char *myfunc()
{
char *temp = "string";
return temp;
}
int main()
{
puts(myfunc());
}
Write C code to dynamically allocate one, two and three dimensional arrays (using malloc())
Its pretty simple to do this in the C language if you know how to use C pointers. Here are some example C code
snipptes....
Method1
Method3
#define MAXX 3
#define MAXY 4
#define MAXZ 5
main()
{
int ***p,i,j;
p=(int ***) malloc(MAXX * sizeof(int ***));
for(i=0;i<MAXX;i++)
{
p[i]=(int **)malloc(MAXY * sizeof(int *));
for(j=0;j<MAXY;j++)
p[i][j]=(int *)malloc(MAXZ * sizeof(int));
}
for(k=0;k<MAXZ;k++)
for(i=0;i<MAXX;i++)
for(j=0;j<MAXY;j++)
p[i][j][k]=<something>;
How would you find the size of structure without using sizeof()?
struct MyStruct
{
int i;
int j;
};
int main()
{
struct MyStruct *p=0;
int size = ((char*)(p+1))-((char*)p);
printf("\nSIZE : [%d]\nSIZE : [%d]\n", size);
return 0;
}
Are you sure you know this? A lot of people think they already know this, but guess what? So take a good look
at this C program. Its asked in most of the interviews as a warm up question.
// Matrix A (m*n)
// Matrix B (n*k)
// Matrix C (m*k)
There a number of ways in which we can find out if a string is a palidrome or not. Here are a few sample C
programs...
Method1
#include <stdio.h>
#include <string.h>
#include <stdlib.h>
#include <ctype.h>
int main()
{
isPalindrome("avon sees nova");
isPalindrome("a");
isPalindrome("avon sies nova");
isPalindrome("aa");
isPalindrome("abc");
isPalindrome("aba");
isPalindrome("3a2");
exit(0);
}
if(*start!=*end)
{
printf("\n[%s] - This is not a palidrome!\n", string);
}
else
{
printf("\n[%s] - This is a palidrome!\n", string);
}
}
printf("\n\n");
}
Method2
#include <stdio.h>
#include <string.h>
#include <stdlib.h>
#include <ctype.h>
int main()
{
isPalindrome("avon sees nova");
isPalindrome("a");
isPalindrome("avon sies nova");
isPalindrome("aa");
isPalindrome("abc");
isPalindrome("aba");
isPalindrome("3a2");
return(0);
}
N = strlen(string);
end = N-1;
99% of the people who attend interviews can't answer this question, believe me!. Here is a recursive C program
which does this....
#include<stdio.h>
void generatebits(int num);
int main()
{
int num;
printf("\nEnter a number\n");
scanf("%d", &num);
printf("\n\n");
generatebits(num);
getch();
return(0);
}
The reason we have shown a recursive algorithm is that, because of the magic of recursion, we dont have to
reverse the bits generated to produce the final output. One can always write an iterative algorithm to accomplish
the same, but it would require you to first store the bits as they are generated and then reverse them before
producing the final output.
Here is a C function
while(left<=right)
{
middle = ((left + right)/2);
if(item == arr[middle])
{
return(middle);
}
return(-1);
}
Note that the Binary Search algorithm has a prerequisite that the array passed to it must be already sorted in
ascending order. This will not work on an unsorted array. The complexity of this algorithm is O(log(n)).
return(sum);
}
p1 = h1->next;
while(p1!=h1)
{
x1 = p1->px;
y1 = p1->py;
cf1 = p1->cf;
p2 = h2->next;
while(p2 != h2)
{
x2 = p2->px;
y2 = p2->py;
cf2 = p2->cf;
p2 = p2->next;
if(p2 != h2)
{
// We found something in the second polynomial.
cf = cf1 + cf2;
p2->flag = 1;
if(cf!=0){h3=addNode(cf,x1,y1,h3);}
}
else
{
h3=addNode(cf,x1,y1,h3);
}
p1 = p1->next;
}//while
while(p2 != h2)
{
if(p2->flag==0)
{
h3=addNode(p2->cf, p2->px, p2->py, h3);
}
p2=p2->next;
}
return(h3);
}
Write a program to add two long positive numbers (each represented by linked lists).
carry = 0;
c1 = h1->next;
c2 = h2->next;
h3 = insertNode(digit, h3);
c1 = c1->next;
c2 = c2->next;
}
if(c1 != h1)
{
c = c1;
h = h1;
}
else
{
c = c2;
h = h2;
}
while(c != h)
{
sum = c->value + carry;
digit = sum % 10;
carry = sum / 10;
h3 = insertNode(digit, h3);
c = c->next;
}
if(carry==1)
{
h3 = insertNode(carry, h3);
}
return(h3);
}
This is Wrong!.
double a, b;
if(a == b)
{
...
}
The above code might not work always. Thats because of the way floating point numbers are stored.
A good way of comparing two floating point numbers is to have a accuracy threshold which is relative to the
magnitude of the two floating point numbers being compared.
#include <math.h>
if(fabs(a - b) <= accurary_threshold * fabs(a))
There is a lot of material on the net to know how floating point numbers can be compared. Got for it if you
really want to understand.
Another way which might work is something like this. I have not tested it!
b1 = (char *)&f1;
b2 = (char *)&f2;
return(EQUAL);
}
The character '\r' is a carriage return without the usual line feed, this helps to overwrite the current line. The
character '\b' acts as a backspace, and will move the cursor one position to the left.
#include <stdio.h>
int main(void)
{
int old, new=3;
return 0;
}
Try this
#include <stdio.h>
int main(void)
{
char ch=0x34;
printf("\nThe exchanged value is %x",swap_nibbles(ch));
return 0;
}
This question is also quite popular, because it has real practical uses, specially during patching when version
comparison is required
---------------------------------------------------------------------
compare_versions()
This function compare two versions in pl-sql language. This function can compare
Versions like 115.10.1 vs. 115.10.2 (and say 115.10.2 is greater), 115.10.1 vs. 115.10 (and say
115.10.1 is greater), 115.10 vs. 115.10 (and say both are equal)
---------------------------------------------------------------------
release_1_str varchar2(132);
release_2_str varchar2(132);
release_1_ver number;
release_2_ver number;
ret_status boolean := TRUE;
begin
return(ret_status);
end compare_releases;
Usage example
static char stamp[] = "***\nmodule " __FILE__ "\ncompiled " __TIMESTAMP__ "\n***";
...
int main()
{
...
...
}
Try
(num<<3 - num)
This is same as
Now, to split this string say into smaller strings of 20 characters each, try this
#define maxLineSize 20
split(char *string)
{
int i, length;
char dest[maxLineSize + 1];
i = 0;
length = strlen(string);
Yes. Divide and conquer method suggests Strassen's matrix multiplication method to be used. If we follow this
method, the time complexity is O(n^2.81) times rather O(n^3) times.
Now, this guy called Strassen's somehow :) came up with a bunch of equations to calculate the 4 elements of the
resultant matrix
If you are aware, the rudimentary matrix multiplication goes something like this
void matrix_mult()
{
for (i = 1; i <= N; i++)
{
for (j = 1; j <= N; j++)
{
compute Ci,j;
}
}
}
So, essentially, a 2x2 matrix multiplication can be accomplished using 8 multiplications. And the complexity
becomes
2^log 8 =2^3
Strassen showed that 2x2 matrix multiplication can be accomplished in 7 multiplications and 18 additions or
subtractions. So now the complexity becomes
2^log7 =2^2.807
P1 = (A11+ A22)(B11+B22)
P2 = (A21 + A22) * B11
P3 = A11 * (B12 - B22)
P4 = A22 * (B21 - B11)
P5 = (A11 + A12) * B22
P6 = (A21 - A11) * (B11 + B12)
P7 = (A12 - A22) * (B21 + B22)
C11 = P1 + P4 - P5 + P7
C12 = P3 + P5
C21 = P2 + P4
C22 = P1 + P3 - P2 + P6
One common answer is that all compilers keep the size of integer the same as the size of the register on a
perticular architecture. Thus, to know whether the machine is 32 bit or 64 bit, just see the size of integer on it.
Write a program to have the output go two places at once (to the screen and to a file also) .
You can write a wrapper function for printf() which prints twice.
myprintf(...)
{
// printf(); -> To screen.
// write_to_file(); -> To file.
}
# include<stdio.h>
void main()
{
int num=123456;
int sum=0;
Given two strings A and B, how would you find out if the characters in B were a subset of the characters in A?
#include <stdio.h>
#include <conio.h>
int main()
{
char str1[]="defabc";
char str2[]="abcfed";
if(isSubset(str1, str2)==0)
{
printf("\nYes, characters in B=[%s] are a subset of characters in A=[%s]\n",str2,str1);
}
else
{
printf("\nNo, characters in B=[%s] are not a subset of characters in A=[%s]\n",str2,str1);
}
getch();
return(0);
}
return(0);
}
Write a program to merge two arrays in sorted order, so that if an integer is in both the arrays, it gets added into
the final array only once.
Try noting down the address of a local variable. Call another function with a local variable declared in it and
check the address of that local variable and compare!
#include <stdio.h>
#include <stdlib.h>
int main()
{
int local1;
stack(&local1);
exit(0);
}
Actually,
SUM = A XOR B
CARRY = A AND B
On a wicked note, you can add two numbers wihtout using the + operator as follows
a - (- b)
How to generate prime numbers? How to generate the next prime after a given prime?
This is a very vast subject. There are numerous methods to generate primes or to find out if a given number is a
prime number or not. Here are a few of them. I strongly recommend you to search on the Internet for more
elaborate information.
Brute Force
Test each number starting with 2 and continuing up to the number of primes we want to generate. We divide
each numbr by all divisors upto the square root of that number. If no factors are found, its a prime.
Table method
Suppose we want to find all the primes between 1 and 64. We write out a table of these numbers, and proceed as
follows. 2 is the first integer greater than 1, so its obviously prime. We now cross out all multiples of two. The
next number we haven't crossed out is 3. We circle it and cross out all its multiples. The next non-crossed
number is 5; sp we circle it and cross all its mutiples. We only have to do this for all numbers less than the
square root of our upper limit, since any composite in the table must have atleast one factor less than the square
root of the upper limit. Whats left after this process of elimination is all the prime numbers between 1 and 64.
Another "Yeah, I am a jerk, kick me! Kind of a question. I simply dont know why they ask these questions.
end:
return;
}
Write your own trim () or squeeze () function to remove the spaces from a string.
getch();
}
*p = '\0';
return(ps);
}
Another version of this question requires one to reduce multiple spaces, tabs etc to single spaces...
Trees
Running time???
Write a C program to find the depth or height of a tree.
tree_height(mynode *p)
{
if(p==NULL)return(0);
if(p->left){h1=tree_height(p->left);}
if(p=>right){h2=tree_height(p->right);}
return(max(h1,h2)+1);
}
The degree of the leaf is zero. The degree of a tree is the max of its element degrees. A binary tree of height n, h
> 0, has at least h and at most (2^h -1) elements in it. The height of a binary tree that contains n, n>0, elements
is at most n and atleast log(n+1) to the base 2.
n = (2^h - 1)
Here is some sample C code. The idea is to keep on moving till you hit the left most node in the tree
return(current->data);
}
On similar lines, to find the maximum value, keep on moving till you hit the right most node of the tree.
Write a C program to create a mirror copy of a tree (left nodes become right and right nodes become left)!
if(root==NULL)return(NULL);
temp->left = copy(root->right);
temp->right = copy(root->left);
return(temp);
}
This code will will only print the mirror of the tree
if (node==NULL)
{
return;
}
else
{
tree_mirror(node->left);
tree_mirror(node->right);
Write C code to return a pointer to the nth node of an inorder traversal of a BST.
mynode *root;
static ctr;
int main()
{
mynode *temp;
root = NULL;
printf("\n[%d]\n, temp->value);
return(0);
}
if(!found)
{
if(root)
{
nthinorder(root->left, n , nthnode);
if(++whichnode == n)
{
printf("\nFound %dth node\n", n);
found = 1;
*nthnode = root; // Store the pointer to the nth node.
}
nthinorder(root->right, n , nthnode);
}
}
}
inorder(mynode *root)
{
// Plain old inorder traversal
}
temp = malloc(sizeof(mynode));
temp->value = value;
temp->left = NULL;
temp->right = NULL;
if(root == NULL)
{
root = temp;
}
else
{
prev = NULL;
cur = root;
while(cur)
{
prev = cur;
cur = (value < cur->value)? cur->left : cur->right;
}
There seems to be an easier way to do this, or so they say. Suppose each node also has a weight associated with
it. This weight is the number of nodes below it and including itself. So, the root will have the highest weight
(weight of its left subtree + weight of its right subtree + 1). Using this data, we can easily find the nth inorder
node.
Note that for any node, the (weight of the leftsubtree of a node + 1) is its inorder rankin the tree!. Thats simply
because of how the inorder traversal works (left->root->right). So calculate the rank of each node and you can
get to the nth inorder node easily. But frankly speaking, I really dont know how this method is any simpler than
the one I have presented above. I see more work to be done here (calculate thw weights, then calculate the ranks
and then get to the nth node!).
Also, if (n > weight(root)), we can error out saying that this tree does not have the nth node you are looking for.
Write C code to implement the preorder(), inorder() and postorder() traversals. Whats their time complexities?
preorder(mynode *root)
{
if(root)
{
printf("Value : [%d]", root->value);
preorder(root->left);
preorder(root->right);
}
}
Postorder
postorder(mynode *root)
{
if(root)
{
postorder(root->left);
postorder(root->right);
printf("Value : [%d]", root->value);
}
}
Inorder
inorder(mynode *root)
{
if(root)
{
inorder(root->left);
printf("Value : [%d]", root->value);
inorder(root->right);
}
}
if(root==NULL)return(NULL);
temp->left = copy(root->left);
temp->right = copy(root->right);
return(temp);
}
Write C code to check if a given binary tree is a binary search tree or not?
Here is a C program which checks if a given tree is a Binary Search Tree or not...
return(true);
}
1
2 3
5 6 7 8
1235678
Pseduocode
Level_order_traversal(p)
{
while(p)
{
Visit(p);
If(p->left)Q.Add(p->left);
If(p->right)Q.Add(p->right);
Delete(p);
}
}
Here is some C code (working :))..
#include <stdio.h>
typedef struct node
{
int value;
struct node *right;
struct node *left;
}mynode;
mynode *root;
add_node(int value);
add_node(5);
add_node(1);
add_node(-20);
add_node(100);
add_node(23);
add_node(67);
add_node(13);
getch();
}
if(root==NULL)
{
printf("\nCreating the root..\n");
root = temp;
return;
}
prev=NULL;
cur=root;
while(cur!=NULL)
{
prev=cur;
cur=(value<cur->value)?cur->left:cur->right;
}
while(root)
{
printf("[%d] ", root->value);
if(root->left)
{
queue[size++] = root->left;
}
if(root->right)
{
queue[size++] = root->right;
}
root = queue[queue_pointer++];
}
}
The node does not exist in the tree - In this case you have nothing to delete.
The node to be deleted has no children - The memory occupied by this node must be freed and either
the left link or the right link of the parent of this node must be set to NULL.
The node to be deleted has exactly one child - We have to adjust the pointer of the parent of the node to
be deleted such that after deletion it points to the child of the node being deleted.
The node to be deleted has two children - We need to find the inorder successor of the node to be
deleted. The data of the inorder successor must be copied into the node to be deleted and a pointer
should be setup to the inorder successor. This inorder successor would have one or zero children. This
node should be deleted using the same procedure as for deleting a one child or a zero child node. Thus
the whole logic of deleting a node with two children is to locate the inorder successor, copy its data and
reduce the problem to a simple deletion of a node with one or zero children.
Here is some C code for these two situations
Situation 1
100 (parent)
50 (cur == psuc)
20 80 (suc)
90
85 95
Situation 2
100 (parent)
50 (cur)
20 90
80
70 (suc)
75
72 76
if(head->left==NULL){printf("\nEmpty tree!\n");return(head);}
parent = head;
cur = head->left;
if(cur == NULL)
{
printf("\nItem to be deleted not found!\n");
return(head);
}
psuc = cur;
cur = cur->left;
while(suc->left!=NULL)
{
psuc = suc;
suc = suc->left;
}
if(cur==psuc)
{
// Situation 1
suc->left = cur->right;
}
else
{
// Situation 2
suc->left = cur->left;
psuc->left = suc->right;
suc->right = cur->right;
}
q = suc;
if(parent->left == cur)
parent->left=q;
else
parent->rlink=q;
freeNode(cur);
return(head);
}
return(root);
}
count_leaf(root->right);
}
}
Write C code for iterative preorder, inorder and postorder tree traversals.
Here is a complete C program which prints a BST using both recursion and iteration. The best way to
understand these algorithms is to get a pen and a paper and trace out the traversals (with the stack or the queue)
alongside. Dont even try to memorize these algorithms!
#include <stdio.h>
mynode *root;
add_node(int value);
void postorder(mynode *root);
void inorder(mynode *root);
void preorder(mynode *root);
if(root==NULL)
{
printf("\nCreating the root..\n");
root = temp;
return;
}
prev=NULL;
cur=root;
while(cur!=NULL)
{
prev=cur;
cur=(value<cur->value)?cur->left:cur->right;
}
// Recursive Preorder
void preorder(mynode *root)
{
if(root)
{
printf("[%d] ", root->value);
preorder(root->left);
preorder(root->right);
}
}
// Iterative Preorder
void iterativePreorder(mynode *root)
{
mynode *save[100];
int top = 0;
if (root == NULL)
{
return;
}
save[top++] = root;
while (top != 0)
{
root = save[--top];
if (root->right != NULL)
save[top++] = root->right;
if (root->left != NULL)
save[top++] = root->left;
}
}
// Recursive Postorder
void postorder(mynode *root)
{
if(root)
{
postorder(root->left);
postorder(root->right);
printf("[%d] ", root->value);
}
}
// Iterative Postorder
void iterativePostorder(mynode *root)
{
struct
{
mynode *node;
unsigned vleft :1; // Visited left?
unsigned vright :1; // Visited right?
}save[100];
int top = 0;
save[top++].node = root;
while ( top != 0 )
{
/* Move to the left subtree if present and not visited */
if(root->left != NULL && !save[top].vleft)
{
save[top].vleft = 1;
save[top++].node = root;
root = root->left;
continue;
}
/* Move up */
root = save[--top].node;
}
}
// Recursive Inorder
void inorder(mynode *root)
{
if(root)
{
inorder(root->left);
printf("[%d] ", root->value);
inorder(root->right);
}
}
// Iterative Inorder..
void iterativeInorder (mynode *root)
{
mynode *save[100];
int top = 0;
while(root != NULL)
{
while (root != NULL)
{
if (root->right != NULL)
{
save[top++] = root->right;
}
save[top++] = root;
root = root->left;
}
root = save[--top];
while(top != 0 && root->right == NULL)
{
printf("[%d] ", root->value);
root = save[--top];
}
No
Tree1
a
b
Tree 2
a
b
preorder = ab
postorder = ba
Preorder and postorder do not uniquely define a binary tree. Nor do preorder and level order (same example).
Nor do postorder and level order.
Construct a tree given its inorder and preorder traversal strings. Similarly construct a tree given its inorder and
post order traversal strings.
inorder = g d h b e i a f j c
preorder = a b d g h e i c f j
Scan the preorder left to right using the inorder sequence to separate left and right subtrees. For example, "a" is
the root of the tree; "gdhbei" are in the left subtree; "fjc" are in the right subtree. "b" is the next root; "gdh" are
in the left subtree; "ei" are in the right subtree. "d" is the next root; "g" is in the left subtree; "h" is in the right
subtree.
Scan postorder from right to left using inorder to separate left and right subtrees.
inorder = g d h b e i a f j c
postorder = g h d i e b j f c a
Tree root is "a"; "gdhbei" are in left subtree; "fjc" are in right subtree.
Scan level order from left to right using inorder to separate left and right subtrees.
inorder = g d h b e i a f j c
level order = a b c d e f g h i j
Tree root is "a"; "gdhbei" are in left subtree; "fjc" are in right subtree.
Here is some working code which creates a tree out of the Inorder and Postorder
traversals. Note that here the tree has been represented as an array. This really simplifies the whole
implementation.
A
B C
DE FG
That is, for every node at position j in the array, its left child will be stored at position (2*j) and right child at
(2*j + 1). The root starts at position 1.
#include<stdio.h>
#include<string.h>
#include<ctype.h>
/*-------------------------------------------------------------
* Algorithm
*
* Inorder And Preorder
* inorder = g d h b e i a f j c
* preorder = a b d g h e i c f j
* Scan the preorder left to right using the inorder to separate left
* and right subtrees. a is the root of the tree; gdhbei are in the
* left subtree; fjc are in the right subtree.
*------------------------------------------------------------*/
void copy_str(char dest[], char src[], int pos, int start, int end);
void print_t();
for(i=0;i<strlen(io);i++)
{
if(io[i]==po[0])
{
copy_str(t[1],io,1,i,i); // We have the root here
copy_str(t[2],io,2,0,i-1); // Its left subtree
copy_str(t[3],io,3,i+1,strlen(io)); // Its right subtree
print_t();
}
}
copy_str(t[2*j],t[j],2*j,0,posn-1);
copy_str(t[2*j+1],t[j],2*j+1,posn+1,strlen(t[j]));
copy_str(t[j],t[j],j,posn,posn);
print_t();
}
}
}
}
void copy_str(char dest[], char src[], int pos, int start, int end)
{
char mysrc[100];
strcpy(mysrc,src);
dest[0]='\0';
strncat(dest,mysrc+start,end-start+1);
if(pos>hpos)hpos=pos;
}
void print_t()
{
int i;
for(i=1;i<=hpos;i++)
{
printf("\nt[%d] = [%s]", i, t[i]);
}
printf("\n");
}
#include <stdio.h>
mynode *root;
if(root==NULL)
{
printf("\nCreating the root..\n");
root = temp;
return;
}
prev=NULL;
cur=root;
while(cur!=NULL)
{
prev=cur;
cur=(value<cur->value)?cur->left:cur->right;
}
return(temp);
while(root)
{
printf("[%d] ", root->value);
if(root->left)
{
queue[size++] = root->left;
}
if(root->right)
{
queue[size++] = root->right;
}
root = queue[queue_pointer++];
}
}
if(root == NULL)
{
return(NULL);
}
5 (node=0)
13 (node=5) 67 (node=6)
Given an expression tree, evaluate the expression and obtain a paranthesized form of the expression.
infix_exp(p)
{
if(p)
{
printf("(");
infix_exp(p->left);
printf(p->data);
infix_exp(p->right);
printf(")");
}
}
Creating a binary tree for a postfix expression
if(isalnum(symbol))
st[k++] = temp;
else
{
temp->right = st[--k];
temp->left = st[--k];
st[k++] = temp;
}
}
return(st[--k]);
}
Evaluate a tree
switch(root->value)
{
case '+' : return(eval(root->left) + eval(root->right)); break;
case '-' : return(eval(root->left) - eval(root->right)); break;
case '/' : return(eval(root->left) / eval(root->right)); break;
case '*' : return(eval(root->left) * eval(root->right)); break;
case '$' : return(eval(root->left) $ eval(root->right)); break;
default : if(isalpha(root->value))
{
printf("%c = ", root->value);
scanf("%f", &num);
return(num);
}
else
{
return(root->value - '0');
}
}
}
How do you convert a tree into an array?
A
B C
DE FG
That is, for every node at position i in the array, its left child will be stored at position (2*i) and right child at
(2*i + 1). The root starts at position 1.
AVL trees are self-adjusting, height-balanced binary search trees and are named after the inventors: Adelson-
Velskii and Landis. A balanced binary search tree has O(log n) height and hence O(log n) worst case search and
insertion times. However, ordinary binary search trees have a bad worst case. When sorted data is inserted, the
binary search tree is very unbalanced, essentially more of a linear list, with O(n) height and thus O(n) worst case
insertion and lookup times. AVL trees overcome this problem.
An AVL tree is a binary search tree in which every node is height balanced, that is, the difference in the heights
of its two subtrees is at most 1. The balance factor of a node is the height of its right subtree minus the height of
its left subtree (right minus left!). An equivalent definition, then, for an AVL tree is that it is a binary search tree
in which each node has a balance factor of -1, 0, or +1. Note that a balance factor of -1 means that the subtree is
left-heavy, and a balance factor of +1 means that the subtree is right-heavy. Each node is associated with a
Balancing factor.
Balance factor of each node = height of right subtree at that node - height of left subtree at that node.
Please be aware that we are talking about the height of the subtrees and not the weigths of the subtrees. This is a
very important point. We are talking about the height!.
Here is some recursive, working! C code that sets the Balance factor for all nodes starting from the root....
#include <stdio.h>
mynode *root;
if(root==NULL)
{
//printf("\nCreating the root..\n");
root = temp;
return;
}
prev=NULL;
cur=root;
while(cur!=NULL)
{
prev=cur;
cur=(value<cur->value)?cur->left:cur->right;
}
return(temp);
if(p == NULL)
{
return(0);
}
else
{
templ = setbf(p->left);
tempr = setbf(p->right);
while(root)
{
printf("\n[%3d] (BF : %3d) ", root->value, root->bf);
if(root->left)
{
queue[size++] = root->left;
}
if(root->right)
{
queue[size++] = root->right;
}
root = queue[queue_pointer++];
}
}
Node = [-20], Left sub-tree height = [0], Right sub-tree height = [0], BF = [0]
Node = [ 1], Left sub-tree height = [1], Right sub-tree height = [0], BF = [-1]
Node = [ 13], Left sub-tree height = [0], Right sub-tree height = [0], BF = [0]
Node = [ 67], Left sub-tree height = [0], Right sub-tree height = [0], BF = [0]
Node = [ 23], Left sub-tree height = [1], Right sub-tree height = [1], BF = [0]
Node = [100], Left sub-tree height = [2], Right sub-tree height = [0], BF = [-2]
Node = [ 5], Left sub-tree height = [2], Right sub-tree height = [3], BF = [1]
[ 5] (BF : 1)
[ 1] (BF : -1)
[100] (BF : -2)
[-20] (BF : 0)
[ 23] (BF : 0)
[ 13] (BF : 0)
[ 67] (BF : 0)
5
1 100
-20 23
13 67
After insertion, the tree might have to be readjusted as needed in order to maintain it as an AVL tree. A node
with balance factor -2 or 2 is considered unbalanced and requires rebalancing the tree. The balance factor is
either stored directly at each node or computed from the heights of the subtrees, possibly stored at nodes. If, due
to an instertion or deletion, the tree becomes unbalanced, a corresponding left rotation or a right rotation is
performed on that tree at a particular node. A balance factor > 1 requires a left rotation (i.e. the right subtree is
heavier than the left subtree) and a balance factor < -1 requires a right rotation (i.e. the left subtree is heavier
than the right subtree).
BEFORE
0 (par)
0 0 (p)
0 0 (tmp)
0 0 0 0
(a) (b)
tmp = p->right;
p->right = tmp->left;
tmp->left = p;
if(par)
{
if(p is the left child of par)
{
par->left=tmp;
}
else
{
par->right=tmp;
}
}
else
{
root=tmp;
}
AFTER
0 (par)
0 0
(tmp)
0 0
(p) (b)
0 0
(a)
0 0
Right rotation
BEFORE
0 (par)
0 0 (p)
0 (tmp) 0
0 0 0 0
(a) (b)
tmp = p->left;
p->left = tmp->right;
tmp->right = p;
if(par)
{
if(p is the left child of par)
{
par->left=tmp;
}
else
{
par->right=tmp;
}
}
else
{
root=tmp;
}
AFTER
0 (par)
0 0 (tmp)
0 0
(a) (p)
0 0
(b)
0 0
Its
2^n - n
So, if there are 10 nodes, you will have (1024 - 10) = 1014 different trees!! Confirm it yourself with a small
number if you dont believe the formula.
A full N-ary tree has M non-leaf nodes, how many leaf nodes does it have?
M + (N ^ (n-1)) = (1 - (N ^ n)) / (1 - N)
Leaf nodes = M * (N - 1) + 1
A
B C D
E F G H I J K L M
Leaf nodes = M * (N - 1) + 1 = 4 * (3 - 1) + 1 = 9
Implement Breadth First Search (BFS) and Depth First Search (DFS).
Depth First Search (DFS) is a generalization of the preorder traversal. Starting at some arbitrarily chosen vertex
v, we mark v so that we know we've visited it, process v, and then recursively traverse all unmarked vertices
adjacent to v (v will be a different vertex with every new method call). When we visit a vertex in which all of its
neighbors have been visited, we return to its calling vertex, and visit one of its unvisited neighbors, repeating the
recursion in the same manner. We continue until we have visited all of the starting vertex's neighbors, which
means that we're done. The recursion (stack) guides us through the graph.
#include <stdio.h>
mynode *root;
// Do a DFS..
printf("\n\nDFS : ");
treeDFS(root);
getch();
}
if(root==NULL)
{
printf("\nCreating the root..\n");
root = temp;
return;
}
prev=NULL;
cur=root;
while(cur!=NULL)
{
prev=cur;
cur=(value<cur->value)?cur->left:cur->right;
}
return(temp);
// DFS..
void treeDFS(mynode *root)
{
printf("[%d] ", root->value);
root->visited = 1;
if (root->left)
{
if(root->left->visited==0)
{
treeDFS(root->left);
}
}
if (root->right)
{
if(root->right->visited==0)
{
treeDFS(root->right);
}
}
}
Breadth First Search (BFS) searches the graph one level (one edge away from the starting vertex) at a time. In
this respect, it is very similar to the level order traversal that we discussed for trees. Starting at some arbitrarily
chosen vertex v, we mark v so that we know we've visited it, process v, and then visit and process all of v's
neighbors. Now that we've visited and processed all of v's neighbors, we need to visit and process all of v's
neighbors neighbors. So we go to the first neighbor we visited and visit all of its neighbors, then the second
neighbor we visited, and so on. We continue this process until we've visited all vertices in the graph. We don't
use recursion in a BFS because we don't want to traverse recursively. We want to traverse one level at a time. So
imagine that you visit a vertex v, and then you visit all of v's neighbors w. Now you need to visit each w's
neighbors. How are you going to remember all of your w's so that you can go back and visit their neighbors?
You're already marked and processed all of the w's. How are you going to find each w's neighbors if you don't
remember where the w's are? After all, you're not using recursion, so there's no stack to keep track of them. To
perform a BFS, we use a queue. Every time we visit vertex w's neighbors, we dequeue w and enqueue w's
neighbors. In this way, we can keep track of which neighbors belong to which vertex. This is the same technique
that we saw for the level-order traversal of a tree. The only new trick is that we need to makr the verticies, so we
don't visit them more than once -- and this isn't even new, since this technique was used for the blobs problem
during our discussion of recursion.
public void breadthFirstSearch(vertex v)
{
Queue q = new Queue();
v.visited = true;
q.enQueue(v);
while( !q.isEmpty() )
{
Vertex w = (Vertex)q.deQueue();
Here is some C code which does a BFS (level order traversal) on a BST...
#include <stdio.h>
mynode *root;
add_node(int value);
add_node(5);
add_node(1);
add_node(-20);
add_node(100);
add_node(23);
add_node(67);
add_node(13);
if(root==NULL)
{
printf("\nCreating the root..\n");
root = temp;
return;
}
prev=NULL;
cur=root;
while(cur!=NULL)
{
prev=cur;
cur=(value<cur->value)?cur->left:cur->right;
}
while(root)
{
printf("[%d] ", root->value);
if(root->left)
{
queue[size++] = root->left;
}
if(root->right)
{
queue[size++] = root->right;
}
root = queue[queue_pointer++];
}
}
#include <stdio.h>
mynode *root;
add_node(int value);
void postorder(mynode *root);
void inorder(mynode *root);
void preorder(mynode *root);
add_node(5);
add_node(1);
add_node(-20);
add_node(100);
add_node(23);
add_node(67);
add_node(13);
printf("\nPreorder : ");
preorder(root);
printf("\n\nPostorder : ");
postorder(root);
printf("\n\nInorder : ");
inorder(root);
return(0);
}
if(root==NULL)
{
printf("\nCreating the root..\n");
root = temp;
return;
}
prev=NULL;
cur=root;
while(cur!=NULL)
{
prev=cur;
cur=(value<cur->value)?cur->left:cur->right;
}
Since traversing the three is the most frequent operation, a method must be devised to improve the speed. This is
where Threaded tree comes into picture. If the right link of a node in a tree is NULL, it can be replaced by the
address of its inorder successor. An extra field called the rthread is used. If rthread is equal to 1, then it means
that the right link of the node points to the inorder success. If its equal to 0, then the right link represents an
ordinary link connecting the right subtree.
struct node
{
int value;
struct node *left;
struct node *right;
int rthread;
}
temp = x->right;
if(x->rthread==1)return(temp);
while(temp->left!=NULL)temp = temp->left;
return(temp);
}
if(head->left==head)
{
printf("\nTree is empty!\n");
return;
}
temp = head;
for(;;)
{
temp = inorder_successor(temp);
if(temp==head)return;
printf("%d ", temp->value);
}
temp=getnode();
temp->info=item;
r=x->right;
x->right=temp;
x->rthread=0;
temp->left=NULL;
temp->right=r;
temp->rthread=1;
}
Function to find the inorder predecessor (for a left threaded binary three)
temp = x->left;
if(x->lthread==1)return(temp);
while(temp->right!=NULL)
temp=temp->right;
return(temp);
}
Bit Fiddling
This is one of the most frequently asked interview questions of all times...
There are a number of ways to count the number of bits set in an integer. Here are some C programs to do the
same.
Method1
#include<stdio.h>
int main()
{
unsinged int num=10;
int ctr=0;
for(;num!=0;num>>=1)
{
if(num&1)
{
ctr++;
}
}
Method2
This is a faster way of doing the same thing. Here the control goes into the while loop only as many times as the
number of bits set to 1 in the integer!.
#include<stdio.h>
int main()
{
int num=10;
int ctr=0;
while(num)
{
ctr++;
num = num & (num - 1); // This clears the least significant bit set.
}
Method3
This method is very popular because it uses a lookup table. This speeds up the computation. What it does is it
keeps a table which hardcodes the number of bits set in each integer from 0 to 256.
For example
0 - 0 Bit(s) set.
1 - 1 Bit(s) set.
2 - 1 Bit(s) set.
3 - 2 Bit(s) set.
...
What purpose do the bitwise and, or, xor and the shift operators serve?
The AND operator
Truth Table
-----------
0 AND 0 = 0
0 AND 1 = 0
1 AND 0 = 0
1 AND 1 = 1
x AND 0 = 0
x AND 1 = x
We use bitwise "and" to test if certain bit(s) are one or not. And'ing a value against a pattern with ones only in
the bit positions you are interested in will give zero if none of them are on, nonzero if one or more is on. We can
also use bitwise "and" to turn off (set to zero) any desired bit(s). If you "and" a pattern against a variable, bit
positions in the pattern that are ones will leave the target bit unchanged, and bit positions in the pattern that are
zeros will set the target bit to zero.
The OR operator
Truth Table
-----------
0 OR 0 = 0
0 OR 1 = 1
1 OR 0 = 1
1 OR 1 = 1
x OR 0 = x
x OR 1 = 1
0 XOR 0 = 0
0 XOR 1 = 1
1 XOR 0 = 1
1 XOR 1 = 0
x XOR 0 = x
x XOR 1 = ~x
Use bitwise "exclusive or" to flip or reverse the setting of desired bit(s) (make it a one if it was zero or make it
zero if it was one).
Operators >> and << can be used to shift the bits of an operand to the right or left a desired number of positions.
The number of positions to be shifted can be specified as a constant, in a variable or as an expression. Bits
shifted out are lost. For left shifts, bit positions vacated by shifting always filled with zeros. For right shifts, bit
positions vacated by shifting filled with zeros for unsigned data type and with copy of the highest (sign) bit for
signed data type. The right shift operator can be used to achieve quick multiplication by a power of 2. Similarly
the right shift operator can be used to do a quick division by power of 2 (unsigned types only). The operators >>
and <<, dont change the operand at all. However, the operators >>= and <=< also change the operand after
doing the shift operations.
x << y - Gives value x shifted left y bits (Bits positions vacated by shift are filled with zeros).
x <<= y - Shifts variable x left y bits (Bits positions vacated by shift are filled with zeros).
For the right shift, All bits of operand participate in the shift. For unsigned data type,
bits positions vacated by shift are filled with zeros. For signed data type, bits positions
vacated by shift are filled with the original highest bit (sign bit). Right shifting n bits
divides by 2 raise to n. Shifting signed values may fail because for negative values the result never
gets past -1:
A simple C command line utility takes a series of command line options. The options are given to the utility like
this : <utility_name> options=[no]option1,[no]options2,[no]option3?... Write C code using bitwise operators to
use these flags in the code.
//Each option will have a bit reserved in the global_options_bits integer. The global_options_bits
// integer will have a bit set or not set depending on how the option was specified by the user.
// For example, if the user said nooption1, the bit for OPTION1 in global_options_bits
// will be 0. Likewise, if the user specified option3, the bit for OPTION3 in global_options_bits
// will be set to 1.
// Assume you have already parsed the command line option and that
// parsed_argument_without_no has option1 or option2 or option3 (depending on what has
// been provided at the command line) and the variable negate_argument says if the
// option was negated or not (i.e, if it was option1 or nooption1)
if (negate_argument)
{
// Setting the bit for this particular option to 0 as the option has
// been negated.
action_mask= ~(OPTION1_BITPOS) & ALL_BITPOS;
tmp_action= tmp_action & action_mask;
}
else
{
//Setting the bit for this particular option to 1.
action_mask= (OPTION1_BITPOS);
tmp_action= tmp_action | action_mask;
}
//Now someone who wishes to check if a particular option was set or not can use the
// following type of code anywhere else in the code.
if(((global_options_bits & OPTION1_BITPOS) == OPTION1_BITPOS)
{
//Do processing for the case where OPTION1 was active.
}
else
{
//Do processing for the case where OPTION1 was NOT active.
}
Method1
int i;
Method2
0 0 0 0 1 0 0 0
^
|
^ ^ ^
| | |
How would you count the number of bits set in a floating point number?
Sorting Techniques
The property of an ascending heap is that, the root is the lowest and given any other node i, that node should be
less than its left child and its right child. In a descending heap, the root should be the highest and given any
other node i, that node should be greater than its left child and right child.
To sort the elements, one should create the heap first. Once the heap is created, the root has the highest value.
Now we need to sort the elements in ascending order. The root can not be exchanged with the nth element so
that the item in the nth position is sorted. Now, sort the remaining (n-1) elements. This can be achieved by
reconstructing the heap for (n-1) elements.
heapsort()
{
n = array(); // Convert the tree into an array.
makeheap(n); // Construct the initial heap.
keepheap(i)
{
l = 2*i;
r = 2*i + 1;
p = s[l];
q = s[r];
t = s[i];
m = s[largest];
if(largest != i)
{
swap(s[i], s[largest]);
keepheap(largest);
}
}
while(i<=m)
c[k++]=a[i++];
while(j<=n)
c[k++]=a[j++];
}
Implement the bubble sort algorithm. How can it be improved? Write the code for selection sort, quick sort,
insertion sort.
Here is the Bubble sort algorithm
To improvise this basic algorithm, keep track of whether a particular pass results in any swap or not. If not, you
can break out without wasting more cycles.
if(flag==0)break;
}
}
temp = a[pos];
a[pos] = a[i];
a[i] = temp;
}
}
key = a[low];
i = low + 1;
j = high;
while(1)
{
while(i < high && key >= a[i])i++;
int main()
{
// Populate the array a
quicksort(a, 0, n - 1);
}
a[j + 1] = item;
}
}
How can I sort things that are too large to bring into memory?
A sorting program that sorts items that are on secondary storage (disk or tape) rather than primary storage
(memory) is called an external sort. Exactly how to sort large data depends on what is meant by ?too large to fit
in memory.? If the items to be sorted are themselves too large to fit in memory (such as images), but there aren?t
many items, you can keep in memory only the sort key and a value indicating the data?s location on disk. After
the key/value pairs are sorted, the data is rearranged on disk into the correct order. If ?too large to fit in
memory? means that there are too many items to fit into memory at one time, the data can be sorted in groups
that will fit into memory, and then the resulting files can be merged. A sort such as a radix sort can also be used
as an external sort, by making each bucket in the sort a file. Even the quick sort can be an external sort. The data
can be partitioned by writing it to two smaller files. When the partitions are small enough to fit, they are sorted
in memory and concatenated to form the sorted file.