TSV Sample Files

TSV File Definition (Wikipedia):

A tab-separated values (TSV) file is a simple text format for storing data in a tabular structure, e.g., database table or spreadsheet data, and a way of exchanging information between databases. Each record in the table is one line of the text file. Each field value of a record is separated from the next by a tab character. The TSV format is thus a type of the more general delimiter-separated values format.

TSV is a simple file format that is widely supported, so it is often used in data exchange to move tabular data between different computer programs that support the format. For example, a TSV file might be used to transfer information from a database program to a spreadsheet.

TSV is an alternative to the common comma-separated values (CSV) format, which often causes difficulties because of the need to escape commas – literal commas are very common in text data, but literal tab stops are infrequent in running text. The IANA standard for TSV achieves simplicity by simply disallowing tabs within fields.

Licensing:

The computer code and data files described and made available on this web page are distributed under the GNU LGPL license.

Sample Files:

These files were converted from open source comma delimited CSV files into the tab delimited TSV/text file.

  • mlb_players.tsv, Major League Baseball Players: Name, Team, Position, height (inches), weight (pounds), age (years). 1034 records, each with six values. There is also an initial header line. (csv source, GNU LGPL)

  • mlb_teams_2012.tsv, Major League Baseball 2012 Season: Team, Payroll(millions), Wins. 30 records, each with 3 values. There is also an initial header line. (csv source, GNU LGPL)

  • deniro.tsv, Rotten Tomato ratings of movies with Robert De Niro. There are 87 records. Each record has Year, Rating, Title. There is also an initial header line. (csv source, GNU LGPL)

  • hw_200.tsv, height and weight for 200 individuals Each record includes 3 values: index, height (inches), weight (pounds). There is also an initial header line. (csv source, GNU LGPL)

Last updated