Data organization in spreadsheets release_r3tshlvzrzhwlgwz3jhr3x6pky

by Karl W Broman, Kara H. Woo

Released as a post by PeerJ.

2017  

Abstract

Spreadsheets are widely used software tools for data entry, storage, analysis, and visualization. Focusing on the data entry and storage aspects, this paper offers practical recommendations for organizing spreadsheet data to reduce errors and ease later analyses. The basic principles are: be consistent, write dates like YYYY-MM-DD, don't leave any cells empty, put just one thing in a cell, organize the data as a single rectangle (with subjects as rows and variables as columns, and with a single header row), create a data dictionary, don't include calculations in the raw data files, don't use font color or highlighting as data, choose good names for things, make backups, use data validation to avoid data entry errors, and save the data in plain text file.
In application/xml+jats format

Archived Files and Locations

application/pdf   738.9 kB
file_q6nx227fkvcwvdl5c2omqa47f4
web.archive.org (webarchive)
peerj.com (web)
Read Archived PDF
Preserved and Accessible
Type  post
Stage   unknown
Date   2017-08-24
Work Entity
access all versions, variants, and formats of this works (eg, pre-prints)
Catalog Record
Revision: 6c34d607-8bbc-43f7-8a26-74259d9fb395
API URL: JSON