How to get the number of bytes in just one line of a file?

Question

I am wondering how I can get the number of bytes in just one line of a file.

I know I can use wc -l to get the number of lines in a file, and wc -c to get the total number of bytes in a file. What I want, however, is to get the number of bytes in just one line of a file.

How would I be able to do this?

How are you identifying the line you want to count the bytes in? — pgoetz, Nov 22 '16 at 17:46
Do you really mean bytes or did you intend to ask for characters? echo -n '€' | wc -c -m will return 3 bytes, 1 character. — Chris Davies, Nov 22 '16 at 18:00
The answers given assume you know the line number of the target line ahead of time. Is this correct, or are might you need to select a target line based on some other criteria? — Kevin Fegan, Nov 22 '16 at 22:06

score 14 · Accepted Answer · edited Apr 13 '17 at 12:36

14

sed -n 10p myfile | wc -c

will count the bytes in the tenth line of myfile (including the linefeed/newline character).

A slightly less readable variant,

sed -n "10{p;q;}" myfile | wc -c

(or sed '10!d;q' or sed '10q;d') will stop reading the file after the tenth line, which would be interesting on longer files (or streams). (Thanks to Tim Kennedy and Peter Cordes for the discussion leading to this.)

There are performance comparisons of different ways of extracting lines of text in cat line X to line Y on a huge file.

edited Apr 13 '17 at 12:36

Community

1

answered Nov 22 '16 at 17:57

Stephen Kitt

434,908

1

nice. i was going to say sed '10q;d' myfile | wc -c', but i like the readability of sed -n 10p. – Tim Kennedy Nov 22 '16 at 17:59
4

@TimKennedy quitting straight after the requested line as in your variant would be better on large files! I think there's a question here with benchmarks. – Stephen Kitt Nov 22 '16 at 18:10
I seriously need to study SED a lot more . . . . Thanks guys!!!1 – chromechris Nov 22 '16 at 18:56
2

Good point about quitting right away. sed doesn't try to figure out whether the given sed commands will ever print any more output. yes | sed -n '10{p;q} | wc -c has a nice combo of readability and performance. It exits right away after seeing line 10. yes | sed -n 10p | wc -c runs forever. – Peter Cordes Nov 23 '16 at 03:42

score 6 · Answer 2 · edited Nov 23 '16 at 13:38

6

Try this:

line=10
tail -n "+$line" myfile | head -n 1 | wc -c

set line to the line number you need to count.

edited Nov 23 '16 at 13:38

Stéphane Chazelas

544,893

answered Nov 22 '16 at 17:48

jayhendren

8,384
2
33
58

I am sure there are still more possibilities to add useless forks. – rexkogitans Nov 23 '16 at 12:49
1

@rexkogitans, you'll find that it's generally faster than using a single sed or awk invocation – Stéphane Chazelas Nov 23 '16 at 13:22
1

@rexkogitans, it will also work in the presence of NUL bytes in the input while many sed or awk implementations would choke on them. – Stéphane Chazelas Nov 23 '16 at 13:38

score 5 · Answer 3 · answered Nov 23 '16 at 03:05

5

A little more straightforward using awk:

awk 'NR==10{print length($0)}' myfile

answered Nov 23 '16 at 03:05

Kevin

40,767

Suggestion: exit after printing, instead of reading the rest of the file. yes | awk 'NR==10{print length($0); exit}' exits right away, but yours loops forever (given infinite input). Or with a file, wastes time reading the whole rest of the file from disk. – Peter Cordes Nov 23 '16 at 03:45
1

@PeterCordes better yet (IMO), skip the rest of the file with nextfile — that way the AWK script can be used to process multiple files! – Stephen Kitt Nov 23 '16 at 13:00
@StephenKitt: Yes, that would be a better design. Good thinking. – Peter Cordes Nov 23 '16 at 13:06
Or just awk 'NR == 10 {print length; exit}'. That's different from the sed|wc solution in that it counts the number of characters as opposed to the number of bytes and doesn't count the line delimiter. – Stéphane Chazelas Nov 23 '16 at 13:19

How to get the number of bytes in just one line of a file?

3 Answers3