How can I get a specific line from a file?

Question

I want to extract an exact line from a very big file. For example, line 8000 would be gotten like this:

command -line 8000 > output_line_8000.txt

Many of the methods below are mentioned in this SO Q&A as well: http://stackoverflow.com/questions/6022384/bash-tool-to-get-nth-line-from-a-file — slm, May 17 '14 at 12:27

gniourf_gniourf · Accepted Answer · 2014-05-17T08:40:31.693

14

There's already an answer with perl and awk. Here's a sed answer:

sed -n '8000{p;q}' file

The advantage of the q command is that sed will quit as soon as the 8000-th line is read (~~unlike the other perl and awk methods~~ (it was changed after common creativity, haha)).

A pure Bash possibility (bash≥4):

mapfile -s 7999 -n 1 ary < file
printf '%s' "${ary[0]}"

This will slurp the content of file in an array ary (one line per field), but skip the first 7999 lines (-s 7999) and only read one line (-n 1).

edited May 17 '14 at 08:40

answered May 17 '14 at 08:33

gniourf_gniourf

1,519

1

they changed it: it's incorrect. One changed it, after another was posted. – devnull May 17 '14 at 08:38
1

Creative commons :) – devnull May 17 '14 at 08:39
@devnull edited accordingly :) – gniourf_gniourf May 17 '14 at 08:40
1

Actually more use of CC is evident. mapfile variant didn't make it way, surprisingly :). – devnull May 17 '14 at 09:09

terdon · Answer 2 · 2014-05-19T00:03:43.293

It's Saturday and I had nothing better to do so I tested some of these for speed. It turns out that the sed, gawk and perl approaches are basically equivalent. The head&tail one is the slowest but, suprisingly, the fastest by an order of magnitude is the pure bash one:

Here are my tests:

$ for i in {1..5000000}; do echo "This is line $i" >>file; done

The above creates a file with 50 million lines which occupies 100M.

$ for cmd in "sed -n '8000{p;q}' file" \
            "perl -ne 'print && exit if $. == 8000' file" \
            "awk 'FNR==8000 {print;exit}' file" 
            "head -n 8000 file | tail -n 1" \
            "mapfile -s 7999 -n 1 ary < file; printf '%s' \"${ary[0]}\"" \
            "tail -n 8001 file | head -n 1"; do 
    echo "$cmd"; for i in {1..100}; do
     (time eval "$cmd") 2>&1 | grep -oP 'real.*?m\K[\d\.]+'; done | 
        awk '{k+=$1}END{print k/100}'; 
    done
sed -n '8000{p;q}' file
0.04502
perl -ne 'print && exit if $. == 8000' file
0.04698
awk 'FNR==8000 {print;exit}' file
0.04647
head -n 8000 file | tail -n 1
0.06842
mapfile -s 7999 -n 1 ary < file; printf '%s' "This is line 8000
"
0.00137
tail -n 8001 file | head -n 1
0.0033

You missed tail | head, which was the best method in your benchmark last time this came up (bash's mapfile didn't come up that time). — Gilles 'SO- stop being evil', May 18 '14 at 23:12
@Gilles thanks added. The mapfile is still the fastest but just barely. — terdon, May 19 '14 at 00:04

cuonglm · Answer 3 · 2014-05-17T09:01:40.057

6

You can do it many ways.

Using perl:

perl -nle 'print && exit if $. == 8000' file

Using awk:

awk 'FNR==8000 {print;exit}' file

Or you can use tail and head to prevent from reading entire file until the 8000th line:

tail -n +8000 | head -n 1

edited May 17 '14 at 09:01

answered May 17 '14 at 08:27

cuonglm

153,898

1

Why FNR and not NR? – terdon May 17 '14 at 12:13
FNR for doing with multiple files. – cuonglm May 17 '14 at 12:14

score 4 · Answer 4 · answered May 17 '14 at 08:35

4

Another version with tail and head

head -n 8000 file | tail -n 1

answered May 17 '14 at 08:35

masegaloeh

344

score 4 · Answer 5 · answered May 17 '14 at 08:35

4

You could use sed:

sed -n '8000p;' filename

If the file is large, then it'd be better to quit:

sed -n '8000p;8001q' filename

You could similarly quit reading the entire file using awk or perl too:

awk 'NR==8000{print;exit}' filename

perl -ne 'print if $.==8000; last if $.==8000' filename

answered May 17 '14 at 08:35

devnull

10,691

@Gnouc cc – devnull May 17 '14 at 08:52
1

@Gnouc If you want to hear that, yes. You picked up early quitting from this one, and the head-tail variant from the other one. It's not unusual on SO. All content is under cc by-sa anyways. So nothing to complain. – devnull May 17 '14 at 09:08

score 1 · Answer 6 · edited May 18 '14 at 14:22

1

How about this?

$ cat -n filename | grep -E "[ \t]+8000"

Example

$ cat -n /etc/abrt/plugins/CCpp.conf  | grep -E "^[ \t]+16"
    16  #DebuginfoLocation = /var/cache/abrt-di

edited May 18 '14 at 14:22

slm

369,824

answered May 18 '14 at 13:29

mbsingh

111

How can I get a specific line from a file?

6 Answers6

Example

Linked

Related