awk is a programming language used for text processing and extracting data from files. It can print selected portions of lines from one or more files based on pattern matching or conditional statements. Some key features include:
- Using selection criteria and actions to process lines matching patterns
- Accessing individual fields of a line using $1, $2 etc and whole line using $0
- Comparison operators like ==, !=, > and regular expression matching using ~ and !~
- Built-in variables like NR (record number), NF (number of fields) for processing files