Linux Help
guides forums blogs
Home Desktops Distributions ISO Images Logos Newbies Reviews Software Support & Resources Linuxhelp Wiki

Welcome Guest ( Log In | Register )

Advanced DNS Management
New ZoneEdit. New Managment.


Sign Up Now
> (G)AWK Script for frequency count and ratio calculation
post Apr 2 2008, 03:27 PM
Post #1

Whats this Lie-nix Thing?

Group: Members
Posts: 2
Joined: 2-April 08
Member No.: 13,413

Hello all,

I am trying to reproduce a graph given in that is Figure 1. Using the titanic data:, I was able to get frequency counts for each combination; however, I am unable to get the ratio for particular cases.

If you look at the titanic data, the data are laid like these:
status, age, sex, survived

I was able to get the counts for all the unique combinations for all the cases; however, I want to get the ratio of 1st class survived to 1st class did not survive. In the given example, that ratio would be 2/1 = 2.

Here's the code that I have written so far to make it generic for any dataset and any variable value (in this case it is "yes" survived):
#!/usr/bin/gawk -f
FS = OFS = ",";
Fields = 4;
Flds2use = 1;
#PredVar = 4;
ClassVal = "yes";

### patterns1: skip blanks and comments
{sub(/\%.*/,"")} ;
/^[ \t]*$/ {next};
/@/ {next};


# /("[^"]*")|('[^\r]*)(\r\n)?/

{ #Records++;
for (i = 1; i <= NF-1; i++)

for (class in Last)
if (class != ClassVal) sum+=Last[class];
#for (f in freq)

#print UnCondProb;
for (word in freq) {
#print word, freq[word]
print ( word ~ ClassVal)
if ( word ~ ClassVal) {
print word, freq[word]
print Num[word]}
else {
for (class in Last)
Denom[word]= freq[word]}

print word, Num[word],Denom[word],Num[word]/Denom[word]

Go to the top of the page
+Quote Post
Start new topic
Replies (1 - 1)
post Apr 3 2008, 08:48 AM
Post #2

Whats this Lie-nix Thing?

Group: Members
Posts: 2
Joined: 2-April 08
Member No.: 13,413

I have posted this at Odesk, if someone wants to make some quick money, he or she can complete this project at Odesk:

Go to the top of the page
+Quote Post

Reply to this topicStart new topic
1 User(s) are reading this topic (1 Guests and 0 Anonymous Users)
0 Members:


RSS Lo-Fi Version Time is now: 20th March 2018 - 10:32 PM