5. Data analysis

Analysing your data

Generate the figures, tables and other statistical results from your analysis data files using the appropriate methods and tools. The results consist of all the findings you report in your research project that are based on computations performed on your analysis data files. They may be presented in various forms, including tables, figures and numerical values reported in the text of the research project.

As in the processing phase, composing command files that execute all the necessary procedures is important.

Write code that generates all the results you report in your research project

  • One or more command files should contain commands that open up your analysis data files and then use that data to generate the output upon which your results are based.
  • Every command that generates any of your results should be preceded by a comment that states which result the command generates.

Example comments

* The following command generates the first column of Table 6.

* The following command generates the second column of Table 6.

* The following command generates Figure 4.

/*  The following command generates the correlation of 0.31 between the variables INC (individual annual income, reported in the natural log of current US dollars) and SATIS (individual subjective self-report of overall satisfaction with life, on a scale of 0—least satisfied—to 10—most satisfied). This correlation is reported on page 27 of the research project.  */

The command files for your analysis phase should not contain any commands that generate new variables or process your data in any way. All the procedures required to prepare your data for analysis should be executed by the command files you wrote for the process phase.

It is often convenient to write all the commands for the analysis phase in a single command file. However, if you think it would make sense, divide the code that generates the results into two or more command files. No matter how you organise your analysis command files, your Readme file will include an explanation of how to use them to reproduce your results.

important

Save the command files you write for the analysis phase in the Command files folder.

Analysis data folder

The fully cleaned and processed data files that you use to generate the results reported in your research project are called analysis data files. They are typically constructed by cleaning, processing and combining data extracted from one or more original data files.

A copy of every analysis data file used for a study should be preserved in the Analysis data folder.

Use a single cleaned and processed data file to generate all the results reported in the research project. This ensures that just one analysis data file needs to be stored in the Analysis data folder. When results are generated from more than one analysis data file, they should all be included.

Licence

Icon for the Creative Commons Attribution-NonCommercial 4.0 International License

Document your research data Copyright © 2023 by The University of Queensland is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License, except where otherwise noted.

Share This Book