Best buys this season: The ELCOT Student Laptop

The Electronics Corporation of Tamil Nadu has started this year’s sale of inexpensive laptops with preloaded Linux (SUSE Linux Enterprise) for students studying in Tamil Nadu. There have been many press releases, for example, see this one. The laptop is said to be procured from Dell or Acer.

The high capacity version of the ELCOT laptop is priced at Rs. 31,717 and is really cheap! To be really convinced of this, I checked out the Dell site for their price on an equivalent configuration. This is really easy, because they have a nice customise feature on their site. So I went here and chose options to make the configuration equivalent to ELCOT’s config here. The results are astounding! The Dell price turns out to be Rs. 55317! The big price addons were 7000 rupees for the 320 GB upgrade, from 160 GB and the 3 years international support which added another (approx.) 8000 rupees. Of course, since Dell has not rolled out its preloaded Linux version in India (Dell sells with Ubuntu preloaded, in the US), the price above includes the price of Microsoft’s operating system.

Students are going to make a killing in this sale! Hurry! Last date for registrations in this first phase is July 15.

Edit: Elcot has extended the last date, as expected. The new last date is 31 July.

Edit 2: Elcot has further extended the last date! The new last date is 15 August.

Breaking the Monotony

Download day Announced!

Download Day 2008

17th June 2008

That’s the official release date that has been announced for Firefox 3 on the Mozilla Developers blog! So lets get ready to implement our pledges and download the best browser out there!

EDIT: 15th June’s “The Hindu” newspapers features an article about Ubuntu and Firefox, how they rock and why they are great for Linux newbies!

Interesting CS Interview Questions #2

This post is a continuation of my interesting CS interview questions series. The last two problems discussed in this post lean more towards logic and probability rather than a CS concept.

Consider the Lock and Unlock operation defined on an m-ary tree as follows:

Lock Node X => If (no node present in the subtree rooted at X (including X) is locked) AND (no direct ancestor of X is locked), then lock X

Unlock Node X => Assumes that it is called only a node that has been previously locked. Simply unlocks the node X.

Design an efficient data structure and algorithms to perform the required lock and unlock operations.

The first solutions that struck me was to simply perform a complete tree traversal and determine if the required conditions for locking are being met or not. This has complexity O(N) where N is the number of nodes. Is this the best that can be done?

On further thought, since we can decide the data structure which suits us best, we can improve the efficiency. Suppose for each node we have a pointer to the parent node. Then, determining if any direct ancestor is locked is simply a loop that climbs the tree, via the parent pointers. How to check if any of the children in the subtree is already locked? Another traversal is possible, but it would not improve our time complexity. If a node is locked, all the direct ancestors need to know that a child is locked. So, this gives the idea to use a flag for each node which is set if a child is locked. Now the lock operation can check if a child is locked in constant time. After locking successfully, it needs to traverse back up to the root via the parent pointers and mark for each node that a child is locked.

How will unlock happen then? We just unlock the node that is given, but we cannot simply unmark the child-node-locked flags in the parents. Why? Because there may be other children that are locked! This seems to complicate things. But not really. Instead of using a flag to hold the information, we use an integer. The integer simply counts the number of child nodes locked. Thus, while unlocking we simply decrement these counts.

Now, what is the time complexity? It is logarithmic because we only look at the path from the node to the root (via the parent pointers).

This one is like a puzzle. A table has 100 coins, 30 of them with Heads facing up. There is another table with no coins. With your eyes blindfolded, what will do so that the number of coins with Heads facing up is equal on each table? You cannot determine if a coin is facing Heads up or not by touching it. You are basically allowed to flip coins and move them from one table to the other.

This problem was pretty challenging for me. Initially the number of Heads in each table is 30 and 0. After experimenting various things with a smaller number of coins, one notices (or at least is supposed to notice) that moving one coin from the first table to the second and flipping it causes the difference in the number of Heads between the tables to decrease by 1. Why? Suppose the coin was a Head in the first table. On flipping and moving it to the other table, the number of Heads in the first table decreases by 1 but the number of Heads on the second table remains the same (as it is added to the Tails count). If the coin was a Tail, then it increases the Heads count in the second table without affecting that of the first table. So the solution is to repeat this process of choosing some coin and flipping it, and moving it to the other table, 30 times. Since the initial difference in the number of Heads is 30, after this process the difference becomes 0. Thus, the number of Heads is equal in each table.

Given a long stream of numbers, how will you choose a single number, such that it is uniformly random among the numbers so far seen?

Although, one can store all the numbers and then choose one at random using standard random functions (like in C), we would like a method that is space efficient. There does exist a method that requires only constant space. Any idea?

Say we have reached the N-th number in the sequence and suppose we already have the solution for the first (N-1) numbers that appeared in the sequence. Suppose this solution is X. Then X was chosen with probability 1/(N-1) from the first (N-1) numbers. Since the N-th number in the sequence should have a 1/N probability of being the random number chosen among the first N numbers in the sequence, we decide if it is indeed the chosen random number with a probability of 1/N. Suppose it were not, then the number X will be the random number chosen. We can verify its probability by noting that it is given by (1/(N-1))*((N-1)/N) = 1/N. Thus, we have the solution: for the i-th number in the sequence, choose it as the random number with probability 1/i, otherwise, do not modify the previously chosen random number.

Interesting CS Interview Questions #1

I’ve been through a lot of CS interviews recently and so I have come across many interesting problems. I find many of them worth sharing. So I am writing this short series of posts on this topic. All of these interviews were for a job post of Software Development Engineer. If you happen to be reading this, as a sort of preparation for an interview, I suggest you attempt to solve the problem yourself and then see how you like my solution (please correct me for any mistakes!). Most interviewers also ask for code. I am not posting that here, since after getting the solution, coding it should be fairly straightforward with a little practice.

Given a binary tree, how will you convert it into a doubly-linked circular list in-place? That is, the left-child and right-child pointers of a node in the tree should be considered as left-node and right-node pointers in the doubly-linked list.

The key here is that only a constant amount of extra space is allowed as the conversion should be performed in-place. That is, we cannot save the data in another, more convenient data structure and then perform the conversion to a linked list.

An important tip is to think of recursive algorithms when faced with problems about trees and especially binary trees.

Using this tip, we think of solving this problem recursively. When we are given the binary tree, what we are actually given is the head pointer of the tree. Thus, we have the head node, say head. We also immediately have the head nodes of the left and right subtrees of this node (head->left and head->right, using C-like notation). Now, we can simply solve the problem recursively. Thus, we can get the following pseudo-code:

getDLL(head):
[Takes the head pointer of a tree, and returns a circular DLL as required above containing the nodes in the given tree.]

  1. save = head
  2. leftList = getDLL(head->left)
  3. rightList = getDLL(head->right)
  4. Now, combine the above, in-order: (leftList, save, rightList). Don’t forget to make it circular by joining the first and the last nodes.

Note that I have not handled the base cases. I am just illustrating the idea. Clearly, the base cases to consider are when either of the sub-trees are empty. If, so just return whatever list is remaining, without adding any null nodes.

Ok, now that you’ve understood the solution, don’t think it’s over. Analyse the complexity. Time complexity is linear in the number of nodes in the tree, i.e. O(N), where N is the number of nodes in the tree. Why? In each, recusive call, the amount of work done other than the recursive calls is constant. [If you are not convinced, complete the code and see for yourself.] Also the number of recursive calls equals the number of nodes. Thus, the result. What is the space complexity? It is not constant because recursion uses the stack. How much space is that?! The maximum size of the stack is proportional to the depth of the tree (why??). I don’t yet know a solution that eliminates this logarithmic space requirement.

It is known that when function calls are made, activation records are created and pushed on top of a stack, which is managed by the operating system. In some systems the stack grows upwards and on some other systems the stack grows downwards. How will you find out if a stack grows upwards or downwards for a program on a particular computer?

This problem is actually pretty easy if you think about it. You are being asked to find if the address of elements of the activation record increase or decrease with nested function calls. The solution is to write a simple C program that saves the address of a local variable (as it is stored on the stack) in a global variable and then this information can be used in the main function, to determine if the stack grows up or down:

#include <stdio.h>
int a,b;
void f(){
       int b; a = &b;
}
void main(){
       int c; b = &c;
       f();
       if(a>b) { /*Stack grows down*/ }
       else { /*Stack grows up*/ }
}

Consider the following code snippets:

for(int i=0;i<100;i++) for(int j=0;j<1000;j++) ;

and,

for(int j=0;j<1000;j++) for(int i=0;i<100;i++) ;

Disregarding all compiler and processor level optimizations, which of these two codes runs faster? Why?

It seems like a nasty puzzle, but the answer is actually pretty simple. We note that all constituent assembly level instructions are of the types: zero assignment, increment, comparison and jump. Let us count them. The first snippet has 101 zero assignment statements, 100,000 increment statements, 1001*100+101 = 100,201 comparison and jump statements. The second snippet has 1001 zero assignment statements, 100,000 increment statements, 1000*101+1001 = 102,001 comparison and jump statements. Clearly, the first one is faster.

Lets spread Firefox!

Download Day 2008

Download Day 2008 will soon be unveiled and hundreds of thousands of people will prepare to download Firefox 3! The higher the number of people who download, the higher the chances of setting a Guinness World Record. Read more about it by clicking the image above.

Though, Firefox is being pushed as a great browser (and it no doubt is), let us not forget that it is also like any software subject to bugs. There does seem to be a sort of careless piece of confidence running around (”If I use Firefox, I am pretty much secure from viruses”, “Firefox is very secure”, etc), and this is very wrong. Let’s be alert and know that Firefox like any other piece of software is not infallible and is actually far from it. So don’t spread Firefox for the wrong reasons! Don’t let people think that they are invulnerable because they use Firefox! People should learn that it is always part of their responsibility to remain alert and not get complacent with security.

New SPOJ Contest has just started

Its been a somewhat long time since anyone has hosted a contest on SPOJ. Now at last Roman Sol is putting up a programming contest at SPOJ. It is between May 3 and May 20 and the problems are tough!

The contest itself is located here.

Enjoy the contest!

The Hardy Heron flys out!

Well, this is just a note. I’m sure everyone already knows! See the features!

YABBA DABBA DOO!

Ubuntu Hardy Heron is almost out!

The release candidate is available! Keeping with its trend to stay on the edge, the release candidate comes with Firefox 3 beta 5 (the last beta version had Beta 4)! Four days to go for the final release, and we at NITT are already getting ready to download the repositories for local usage (we don’t have an internet connection though we have a lan in many places). I am sure there must be lots of Indian colleges out there who would be doing similar things because of the lack of a high bandwidth internet connection.

I am posting the script to automatically start the download of Ubuntu Hardy Heron when the release is available on 24 April 0530 hours IST:

function download { echo "Starting the download of Gutsy Gibbon Repos" debmirror ~/ubuntu/hardy-i386 --host=in.archive.ubuntu.com --root=ubuntu/ --nosource -m --passive --method=http --progress --ignore-release-gpg --section=main,universe,multiverse,restricted --arch=i386 --dist=hardy -proxy=http://10.0.0.111:3128 & } while [ 1 -eq 1 ] do if [ 1208995200 -le `date +%s | bc` ] #1208995200 is the time stamp for the release date in my time zone (IST) then download break else echo sleeping sleep 1000 fi done

This is a bash shell script. Run it using the command “bash ./downloadScript.sh”, where “downloadScript.sh” is the name of the file containing the above code.
You can put this line in your /etc/rc.local to make it run every time the computer starts. The download will always commence from where it was left off the last time!

Here is a neat preview video of the beta release:

An interesting easy problem

I haven’t been writing for a pretty long time now :) Anyways.

Someone asked me to solve this problem:

Given an array A of integers of length N, find a sub-array whose sum is a multiple of N.

To solve it in a brute force manner could be a reasonable exercise for beginners in programming. The solution is to just consider all possible sub-arrays and check if their sum is a multiple of N. This could be O(N3). If you use an cumulative sums array (ie, S[i] =∑A[k] where k goes from 1 to i) the solution is slightly smarter and can be solved in O(N2): just choose all possible (a,b) such that 1<=a<b<=N and see if S[b]-S[a] is a multiple of N (make sure you handle corner cases).

Now we come to the more interesting linear solution. Use the same cumulative sums array as above but make it so that S[i] = ((∑A[k]) mod N) or in C parlance: S[i] = (∑A[k])%N. Clearly, if any of these values is 0, we are done: the array A[1], A[2], …, A[i] constitutes a sub-array with the required property if S[i] = 0. Also, the notice that if any of the remainders occur twice, i.e. S[i] = S[j] for i < j, then A[i+1], A[i+2], …, A[j] constitutes a subarray with the required property. With these observations the linear solution should be obvious: just use another array of size N to keep track of which remainder was seen where (i.e. let R[i] be the index in the array S, where the remainder was i) and then output a solution when a particular remainder is seen twice, or when the remainder 0 is seen.

Some of you may also have stumbled onto another realization:

A solution to this problem always exists!

Its not such a big deal anyway: If none of the remainders occur twice in the cumulative sums array, clearly one of the remainders has to be 0 (since there are N cumulative sums and N remainders)! Thus, the result!