how to make grep ignore first line and process other line

Question

I need to remove line beginning with '#' in some txt file. but ignoring the first line as it header. how to make grep ignore first lines and remove any line beginning with # for rest of the lines?

cat sample.txt
#"EVENT",VERSION, NAME
1,2,xyz
1,2,abc
1,2,asd
1,2,ert
#"EVENT",VERSION, NAME
1,2,xyz
1,2,abc
1,2,xyz

cat sample.txt | grep -v "^\s*[#\;]\|^\s*$" > "out.txt"

but this removes the header too!

Possible duplicate of [Omitting the first line from any Linux command output](https://stackoverflow.com/q/7318497/608639), [Print a file skipping first X lines in Bash](https://stackoverflow.com/q/604864/608639), etc. — jww, Apr 21 '19 at 05:30
i dont think its same. I need to write header in the output file too — Aprilian8, Apr 21 '19 at 05:36

Cyrus · Accepted Answer · 2019-04-21T06:29:22.427

6

With sed:

sed '2,${/^#/d}' sample.txt

From second row (2) to last row ($): search (/.../) for rows beginning (^) with # and delete (d) them. Default action of sed is to print current row.

Output:

#"EVENT",VERSION, NAME
1,2,xyz
1,2,abc
1,2,asd
1,2,ert
1,2,xyz
1,2,abc
1,2,xyz

edited Apr 21 '19 at 06:29

answered Apr 21 '19 at 05:57

Cyrus

77,979
13
71
125

score 1 · Answer 2 · answered Apr 21 '19 at 05:39

Try a combination of head and grep like so:

head -1 sample.txt > out.txt && grep -v "^#" sample.txt >> out.txt

Result

#"EVENT",VERSION, NAME
1,2,xyz
1,2,abc
1,2,asd
1,2,ert
1,2,xyz
1,2,abc
1,2,xyz

Alternate method

grep "^#" sample.txt | head -1 > out.txt && grep -v "^#" sample.txt >> out.txt

That is - grep lines beginning with # but just choose the first one and write it to a file. Then, grep all lines not starting with # and append those liens to the same output file.

score 1 · Answer 3 · answered Apr 21 '19 at 21:13

1

This will cause any awk to print each line if its line number is 1 or it doesn't start with #:

$ awk 'NR==1 || !/^#/' file
#"EVENT",VERSION, NAME
1,2,xyz
1,2,abc
1,2,asd
1,2,ert
1,2,xyz
1,2,abc
1,2,xyz

answered Apr 21 '19 at 21:13

Ed Morton

172,331
17
70
167

score 1 · Answer 4 · answered Apr 21 '19 at 21:33

1

This might work for you (GNU sed):

sed '1b;/^#/d' file

Ignore the first line and delete any other lines that start with #.

answered Apr 21 '19 at 21:33

potong

51,370
6
49
80

score 1 · Answer 5 · answered Feb 04 '21 at 20:29

Applying an arbitrary command to all but the first line - a "header" - of a file or stream of tabular data is such a common task for me that I define a helper utility called body for it:

As a shell function (put this in your ~/.bashrc or equivalent):

body() {
  IFS= read -r header
  printf '%s\n' "$header"
  "$@"
}

Now:

$ cat sample.txt | body grep -v '^#'
#"EVENT",VERSION, NAME
1,2,xyz
1,2,abc
1,2,asd
1,2,ert
1,2,xyz
1,2,abc
1,2,xyz

Credit: adapted from: Command line tools for doing data science, where it's a one of many handy data tools you can put in your shell's PATH variable. Wish many of these could canonicalized as standard UNIX tools.

Perfect for grepping `lsof` and `ps` results and keeping the header! — John, Feb 04 '22 at 07:01

score 0 · Answer 6 · 2019-04-23T18:27:42.020

0

tried on gnu sed

sed '0,/^#/n;/^#/d' sample.txt

edited Apr 23 '19 at 18:27

answered Apr 21 '19 at 10:55

how to make grep ignore first line and process other line

6 Answers6