find the word that appears the most at the beginning of a line from entire paragraph

Question

I have a paragraph and I want to know which word appears the most at the beginning of a line from all paragraph

for example: paragraph:

Hello my name is X

Nice to meet you

Hello my name is Y

so Hello appears 2 times so i will output hello

score 4 · Answer 1 · answered Apr 17 '19 at 13:29

4

awk -v RS= '
  {word = tolower($1); n = ++count[word]}
  n > max {max_word = word; max = n}
  END {print max_word}'

answered Apr 17 '19 at 13:29

all my text is lowercase sorry! so what code now use?? – John B Apr 17 '19 at 13:31
This code will work fine if the input is all lower case. But, if you want to ‘optimize’ it, change word = tolower($1) to word = $1. – G-Man Says 'Reinstate Monica' Apr 23 '19 at 04:00

score 1 · Answer 2 · answered Apr 26 '19 at 13:18

1

Below command will give you the required most repeated word along with the count.

cut -d ' ' -f1 file.txt | sort | uniq -c | head -1

answered Apr 26 '19 at 13:18

Swapnil Dhule

score 0 · Answer 3 · edited Apr 26 '19 at 12:21

0

Tried with below associate array method

awk 'NF{a[$1]++}END{for(x in a){print x" appears "a[x]}}' | sort -k3 -nr | sed -n '1p'

output:

Hello appears 2

edited Apr 26 '19 at 12:21

αғsнιη

answered Apr 25 '19 at 20:13

Praveen Kumar BS

score -1 · Answer 4 · answered Apr 23 '19 at 02:57

-1

Why not easy part... awk '{ print $1 }' myfile |uniq -c

answered Apr 23 '19 at 02:57

Kalpesh Bhoj

maybe you needed awk 'NF{ … }' infile| sort| uniq -c | sort -r|head -n1 | ... '? what you currently answered doesn't do anyting just adding 1 in front of each first word in a line. also single awk do the job then. – αғsнιη Apr 23 '19 at 06:03

4 Answers4