Favorite Color

November 11, 2014

Little database problems like this can be solved by big Scheme programs or little Awk programs; despite my fondness for Scheme, I solve problems like this using Awk:

awk ' /^favoritecolor: / { color[$2]++ } END { for (c in color) print color[c], c } ' database | sort -rn | sed '1q' | awk ' { print $2 } '

The first line finds all favoritecolor database fields and counts the occurrences of each color. The second line prints each color/count combination on a separate line. The third line sorts in reverse numeric order. The fourth line selects the first (maximal) color. The fifth line strips the count and prints the color. Easy to do, and written as fast as I can type.

I wrote this little exercise because I had a problem at work today that could be solved in a manner similar to this. It’s a reminder that we don’t always need big programs; sometimes, a little program will do the job just as well.

You can see the program at http://programmingpraxis.codepad.org/cFVZnPVP. If you have some other little program, you might want to share it with the rest of us.

Posted by programmingpraxis

Filed in Exercises

3 Comments »

3 Responses to “Favorite Color”

James Curtis-Smith said

November 11, 2014 at 9:18 AM

Or you can do it in perl…

perl -e '$x{$_}++ foreach map {m{favoritecolor: (.*)} ? $1 : ()} <>; print [sort {$x{$a} <=> $x{$b}} keys %x ]->[-1],"\n";' file.txt

Jussi Piitulainen said
November 11, 2014 at 10:22 AM
Nah. I thought of Awk, and I thought of Python’s Counter objects, but this is what I would actually do, including the output of more than one line, to see if there is a tie or a near tie.
$ grep -E '^favoritecolor:' particular.txt | cut -d ' ' -f 2 | sort | uniq -c | sort -nr | head
Mike said
November 13, 2014 at 1:03 AM
Unfortunately, my UNIX command line is so rusty, I’d use Python.

Assumes one name/value pair per line in the file (problem doesn’t specify). Didn’t bother splitting the name/value pair, because it doesn’t change the counts. Outputs a sorted list of all the colors and their counts.
```
from collections import Counter
import re

match = re.compile(r'favoritecolor: .+').match
with open('./testdb.txt', 'rt') as f:
    Counter(filter(match, f)).most_common()
```

S	M	T	W	T	F	S
						1
2	3	4	5	6	7	8
9	10	11	12	13	14	15
16	17	18	19	20	21	22
23	24	25	26	27	28	29
30

Programming Praxis

Favorite Color

November 11, 2014

3 Responses to “Favorite Color”

Leave a comment

Categories

Archives

Archives

Programming Praxis

Favorite Color

November 11, 2014

Share this:

Related

3 Responses to “Favorite Color”

Leave a comment

Categories

Archives

Archives