remove duplicate lines based on first three column values

Question

I am getting a file with content below

The first three values might be repeating in other lines I want to keep one instance and remove other duplicates

the output should be like below

Please replace the images of the data with the actual data (as text), so that popelp are able to test their solutions. Don't post images of text — Kusalananda, Sep 02 '20 at 11:54
Welcome to the site. How do you define "might be repeating"? Do you want to remove a line if the exact combination of value1, value2 and value2 has already occured, or if any of value1 or value2 or value3 has already occurred in a previous line? — AdminBee, Sep 09 '20 at 10:18

score 2 · Answer 1 · answered Sep 02 '20 at 12:14

2

I would try

awk '!a[$1 $2 $3]++ { print ;}' file

where

answered Sep 02 '20 at 12:14

Archemar

2

In the general case, it would be safer to use a[$1,$2,$3] (with commas) as that inserts the value of SUBSEP between the values that makes up the key instead of just concatenating. A set of 1, 23, 4 would otherwise be indistinguishable from the set 12, 3, 4. Also, { print; } is not actually needed. – Kusalananda Sep 02 '20 at 12:27
thats absolutely solved my issue. Thanks aton – sravani Sep 02 '20 at 13:05
1

@sravani Consider accepting a post that solves your problem. – Quasímodo Sep 02 '20 at 13:10

1 Answers1