Research Experience for Teachers

Description of the Scientific Process

The following document is intended to be a guide for high school students who wish to conduct original research, especially for science fair projects. It is broken up into brief sections for the website, but an Acrobat Reader file (.pdf) is also available. The Guide for Using Excel for Statistics and Charts and the Statistical Examples in Excel are resources to help students use Excel to analyze and present their data.

Introduction
Description of the Scientific Process
John Stewart, Robert Burnap, Julie Angle, and Andrew Doust
The textbook description of the steps of the scientific process seems straightforward: observe, hypothesize, experiment, analyze, theorize, and communicate, but every one of those steps is fraught with misconceptions and difficulties, and science doesn't always follow those steps as neatly as we would like. What experiment is the paleobotanist doing when she digs up a plant fossil and analyzes it? It's not exactly a controlled experiment, but it is science nonetheless. She can generate perfectly valid theories and communicate those to her peers. Still, the textbook guidelines for scientific work are valid and provide a great framework for your own science fair project.
In this document, we will explore the steps of scientific processes in a practical way. We will give you advice about what to do and warn you of common pitfalls along the way. We will show you examples of success and failure—and really, failure is often the best, if cruel, teacher. We will show you alternative approaches that you may take. While this guide is laid out in the order in which you will be doing your work, everything is connected, and you should read the whole thing before you get started doing your research. Well, at least read ahead a section or two, since it might help.
We are assuming that you will be doing original research and not just mixing baking soda, vinegar, and food coloring to make a "volcano." Original research is harder than just showing off some well-known scientific principle. You have to think independently and creatively. Possibly the very best thing about original science, though, is that when you do it, you become the first person to know a piece of the truth about the universe that no one else has ever known. Then, you get to tell the world about your discoveries. Trust us—that's rewarding. It's like delicious lemonade after an afternoon of yard work but way better. In these pages, we will treat you as someone who wants to achieve that. The language will be informal and (I hope) warm. We will avoid getting overly technical when possible, but we don't want to talk down to you. If you have decided to do a science fair project, we truly respect you and your efforts. Now, get cracking.
Before you dig into this document, you should consider who “we” are in the text. We can’t help but to let our backgrounds affect our examples and outlooks.
John Stewart: I’m the primary writer, and I’m a plant guy and a genetics guy. Maybe you’re a physics or geology person. That’s fine and great. Just know that my examples might skew toward the biological and botanical. I don’t want to exclude your interests, but the examples that I include are just the ones that came to mind and that I am more excited about. Try to think about examples from your field of interest when you see what I write. It’s a great exercise, and if you can’t think of one straight away, it’s a good chance for you to brainstorm with your peers.
Andrew Doust: I am also a plant guy and geneticist, but with a strong interest in the genes that control development of organisms. In particular I am interested in the genes that underlie human domestication of crops such as wheat, rice, and corn. We will try and include examples from each of our areas of expertise and interest so that you get a broader range of views on what science is and what interesting projects you might tackle.
Julie Angle: I am a science educator. As a high school science teacher I taught chemistry and biology, and mentored students in science research and coached students to regional, state, and international science and engineering fair competitions. Now that I teach at the college level I am interested in how to better prepare individuals to become excellent science teachers and help them become proficient in the three constructs of science literacy.
Rob Burnap: I am interested in basic molecular aspects of how cells function and most of my work involves the study of photosynthesis. We use model bacteria, known as cyanobacteria, that possess essentially the same mechanism of photosynthesis found in plants and algae, but because of the genetic simplicity of these bacteria, we are able to detailed molecular mechanisms to test our hypotheses on how photosynthesis works and how it is used to drive the growth of the organism.
Observation and Finding a Problem to Study
The first step in doing an experiment is finding your system[1]. Thinking of something specific to study that is interesting to you is often the most difficult part of the process, so you shouldn’t feel like a failure at this step. (Failure comes later.) You need to ask yourself some questions[2].
First, what are you interested in? Do you like a particular field of science the most? This question won’t get you to your experiment, but it will narrow the list of possible experiments down. If you happen to come up with an experiment that is outside of what you think of as your favorite kind of science, though, that’s fine. One of the best things about being a scientist is that our work constantly changes. Let’s say you like biology. That is the study of life, all life. That might be a bit too broad, so you will need to narrow it down. With something like biology, you can do that several ways. You can break it down by the kinds of life you want to study (by taxon, as a biologist would say.) That could lead you to zoology (animals), botany (plants), microbiology (bacteria and other small things), or mycology (fungi). Alternatively, you can break it down by approach: genetics, biochemistry, physiology, anatomy, etc. Preferably, you should do both. Alternatively, you can choose based on application. Consider, for example, cancer research, toxicology, or robotics. You may be limited to whatever your materials allow you to do, and you should accept that as a positive thing, since it narrows your choices.
A lot of scientific research is performed on model systems. These are well-studied systems that are well-understood, so scientists can control for pitfalls and more easily reference other work. In biology, they are very common, as there are a number of model organisms. While humans can be considered a model organism, it’s hard to design and fill out the paperwork for ethical experiments with people, so experiments are done with Pan troglodytes (chimpanzee) and Macaca mulatta (rhesus macaque). Others include Drosophila melanogaster (fruit fly), Arabidopsis thaliana (a weed), and Escherichia coli (a kind of bacteria), Dictyostelium discoideum (a slime mold), and Caenorhabditis elegans (a tiny worm).
In human research, HeLa lines (human cells derived from one woman’s cancer in 1951—it’s a fascinating and compelling story that you should look into) are frequently used, since they have human genes.
That still doesn’t necessarily give you a project, so here’s another bit of advice: think locally. If you want to do original research for your science fair project, think about what’s immediately around you. You’ll have ready access to local problems. The odds that someone has studied a local kind of rock, organism, or whatever are smaller than if you think about something everyone already knows is big and important. Local quirks about wildlife and geology are often very interesting but unexplored.
Speaking of local, consider what other people have told you. Maybe a friend of your parents mentioned that something was funny about the soil near his home. That might be something to look at. Likewise, you can ask people you know what they think would be interesting to study. Most folks will look at you blankly, but you might get a few people who give you the start of an idea that you can follow.
Instead of asking non-experts, you can look at what the experts say. While a lot of real scientific research is behind a pay wall (paying $25 to $50 for access to a 10-page paper is ridiculous but common) enough of it is available to you for free that you can skim through it. You will be looking through titles that seem really technical or really vague, but you can read the abstracts. When you do so, you are looking for experiments that were done on systems similar to the ones you’re looking at. A scientist did a study on wheat growth relative to clay in the soil.
Why not repeat that with a local weed[3]? Look at the scientist’s methodology and repeat it for your weed.
John’s anecdote: Reading abstracts is something that scientists do all the time. I once had a boss who described his reading of papers as follows:
1. Read the title. If it interests you, then…
2. Read the abstract. It’s a summary of why the work was done, what was done, what was discovered, and why that’s important. (We’ll talk about writing abstracts when we talk about writing in general.) If the abstract interest you, then…
3. Look at the figures and tables. If you want to understand what’s going on, then…
4. Read the discussion and reference the results. If you still want background, or you need to know how things were done, then read the introduction and the materials and methods.
You could just start by reading the whole paper, but this is more efficient when you’re just exploring the literature.
One of the most important things to think about at all stages of the scientific process is variation. At this point, think about how your subject might vary against something else that might vary. In the simplest case, you will have some measure that can vary from subject to subject, i.e., the height of a plant, and you will have another measure that you can control, i.e., the amount of water the plant receives. We call the first measure the dependent variable and the second measure the independent variable. You may have several of each of these, and we’ll talk about how to manage those when we talk about experimental design. Right now, you need to think about what different things might vary.
What if you want to examine something that you aren’t controlling? Consider a lake and the soils surrounding a lake. You might find that the soils vary in consistency or type (soil science is a big and complicated field; you can find a lot of job opportunities in soil science, by the way) and that the distance from the lake has a lot to do with that. The soil type would be your dependent variable, and the distance from the lake would be your independent variable. In another experiment on wild plants and animals, both distance from the lake and soil type would be considered independent variables. We call an experiment like this a natural experiment, and it’s something you see frequently in medicine, geology, ecology, and even in social sciences like economics. Natural experiments are often more difficult to perform and noisier[4], but they can shed light on problems that cannot be examined with traditional experiments.
So, you’ve found something you want to test. What next? Can you just jump into your experiment? No! You need to create a hypothesis. The relevant Oxford English Dictionary definition of the hypothesis is, “A supposition or conjecture put forth to account for known facts; esp. in the sciences, a provisional supposition from which to draw conclusions that shall be in accordance with known facts, and which serves as a starting-point for further investigation by which it may be proved or disproved and the true theory arrived at.” But that’s stuffy English professor speech. Here’s what you need to know. Your hypothesis is a reasonable guess of what might be going on with your system that can be tested with an experiment. “I hypothesize that the plant species Taraxacum officinale (common dandelion) will have less biomass when it has less available water,” is a reasonable, if boring, hypothesis. Other examples:
- I hypothesize that plant or microbial communities differ in sites with different amounts of sedimentary rock.
- I hypothesize that the seed of Acer negundo (boxelder) will fall more quickly than the seed of Acer rubrum (red maple.)
- I hypothesize that farmers will be more accepting of controlled burns than city residents will be.
- I hypothesize that beef fat content will decrease in cattle raised in drought conditions relative to normal climatic conditions.
- I hypothesize that the traffic capacity of a local avenue will decrease after a new traffic light is added.
- I hypothesize that different quantities of water and sodium citrate will be needed to emulsify different kinds of cheeses.
- I hypothesize that molded ABS plastic will have higher flexural strength than 3D printed ABS plastic.
- I hypothesize that mutant cyanobacterial cells with defects in their light-capturing antennae will require more light to achieve the same growth rate as normal cells.
So, you’ve got your hypothesis. What now?
[1] The word system will be used throughout this paper. It’s kind of vague, but basically it is the thing that you are studying. It could be the lake in the park, the drought response of Capsella bursa-pastoris (shepherd’s purse), or how four-year-olds respond to the prisoner’s dilemma. It’s a very general term for the thing you are studying.
[2] Note from Andrew: I would say that it is important to think of many possibilities, some of which are certain not to work when you really start to look at them closely!
[3] Don’t do experiments on illegal things here. This is a science fair project, so your research will be publically known.
[4] Noise is a term that scientists use to refer to variations in the data that are not meaningful. It’s like static on an analog radio set. Scientists often refer to separating the signal from the noise. The signal is the meaningful information that you can find in variation. It’s like the music on a radio station. We will talk more about how you can use statistics to separate the signal from the noise in a separate part of this document.
Designing Your Experiment
To test your hypothesis, you need to come up with an experiment. This is when you get to decide what work you will actually do. You need to start by taking your hypothesis and the independent and dependent variables that you think that you can test and figuring out how you are going to test them.
Materials and methods sections of papers are really boring, and even scientists tend to skim over them. We do use them to figure out what we’re going to do, though. I may want to use someone else’s methodology on my system. Here is some advice for how to digest them:
- Materials and methods are densely written. Separately, write down what is relevant so that you can write it down and adapt it into a protocol.
- Authors often reference other articles when they describe what they are doing.You may have to go and find them to replicate the methods.
- Know what units the authors are talking about. You’ll see a lot of nm’s, μL’s, and ω’s in papers. If you don’t know, then look them up (the funny symbols are Greek, typically). If you check on Wikipedia, you can find out all the different ways they are used in science.
In a traditional controlled experiment, you will need what is called the control sample and the treatment sample, or experimentalsample, (maybe samples.) Let’s start with the word sample. It refers to a set of specimens, measurements, etc. that you will group together for repetition. The number of those things is called the sample size, and we will address that soon. Your control sample is the sample that is under somewhat normal conditions. The term “normal” is loaded, but the important thing is that you know what “normal” means for your experiment. Scientists argue and fuss about how valid controls are all the time, but when you feel satisfied that you have a set of normal conditions, then you should proceed. The treatment samples are those with conditions that somehow vary from your control in defined ways. For example, you water your control dandelions with 100 mL of water every day, you water your drought treatment dandelions with 50 mL of water every third day, and you keep your waterlogged dandelions semi-submerged. Here, you are varying water exposure relative to your control (you are also varying water volume and watering frequency, so you may have to adjust for that.)
It’s important that you consider what might be realistic for your treatments. What is a reasonable range of the variable? I’d bet that those waterlogged dandelions aren’t going to get very far, and it might not be a good level of exposure, since they will likely die within a few days. You have to consider this issue for your own experiment. Here is where you can dive into the literature. Scientific papers have to include their materials and methods, that is what things they used and how they used them. Check those out.
How do you find out what treatment extremes you could use? First, you should look at what others have
John’s anecdote: I once performed a study testing the rate of hybridization of shortleaf pine and loblolly pine in a shortleaf pine forest adjacent to a loblolly pine plantation. As expected, there were many hybrids in the sample site across a road from the plantation, but all of the other sites that were staggered away from the site had few hybrids. As it turned out, my results were not great, and I could only make weak claims in my paper. The lesson is that you should have a good idea of what scale your treatments act over. If my sample sites were all closer to the plantation, I may have been able to detect the distance that loblolly pine pollen was fertilizing the shortleaf pine cones.
Andrew’s note: I LOVE pilot experiments! You can be wacky with them, really testing out extremes. If you don’t you may find yourself in the position of my first grad student who did lots of experiments being too small to get really interesting results. Of course, what other scientists have already published can be a pilot experiment for you as well, to help you understand the responsiveness of the study system.
done in similar experiments. Yes, that means you have to look at the literature again, but if you used any scientific literature to get to the point you are now, then that same literature will probably have what you need. You can also ask an expert. Any scientist worth his or her salt would be delighted to correspond with you about setting up an experiment, and really, the biggest danger here is that they make your scope of study way too big to handle. Remember that what they tell you is advice and not a dictate. Finally, you can try a pilot experiment.
Pilot experiments are miniature trials that you can use to test your methods. (Professionals sometimes use them to justify grant requests.) If you have the time and resources, then you can try running your experiment with a lot of treatments to work the kinks out. In addition to informing you about extremes of treatment, pilot experiments can help you work out bugs in an apparatus you built or show you that you need to take extra care in some step.
Consider our dandelion example. Let’s say that you have already decided to put your plants into 2” × 2” × 3” pots. Now, you need to know how much water you should add to get reasonable plant growth. You will just want a range of watering conditions repeated two or three times to get a feel for their effects. Those conditions might be 2 mL/day, 5 mL/day, 25 mL/day, 50 mL/day, 100 mL/day, and 250 mL/day. Notice that these conditions do not scale in a linear way. That is, they are not in increments of 10 mL or something. This way, you can capture a better representation of varying growth conditions with only a few samples. You might also only grow your plants for 2 weeks in the pilot experiment instead of the full 4 weeks you would use in a planned experiment.
You will also need to visualize how your experiments will work and where they might introduce accidental experimental bias. Consider a flat of daisies in an experiment. It’s placed on a windowsill on the east facing side of a house. In the morning, the flat gets sun, so all seems well. You then notice that the easternmost daisies are growing faster, and you realize that they are shading out the other daisies. More than likely, this experiment has been ruined by an accidental bias introduced in the design. Even if the flat of daisies was placed under a sunlamp so that all of the plants got even light, you would notice that the plants around the edges are growing better, since they are less crowded. We have to ignore our edge plants and only collect data for the others. If you had designed the experiment so that all of your treatments were in neat rows, and your control plants had all been on the edge, you just lost your control samples. Ouch!
Besides edge effects, there are a great many things that can add bias to your results. If you need to test multiple samples with a device like a probe, each exposure could change the probe’s sensitivity. Again, if you measured all of your control samples first, and then measured all of your treatment samples, you might introduce a bias into your experiment through the order of sampling. Social scientists frequently use surveys and have to be very mindful of the wording of the questions they use. A given political affiliation or racial group might respond very differently from another just because some word with a loaded history was included in the question.
What do you do about this? First, randomize. Randomize the order of the plants in the flat or the order in time of the samples you measure with a probe. You can do this in a spreadsheet[1] or with a deck of playing cards. Second, include dummy samples. On your flat of daisies, just grow daisies around the edge that are not considered to be treatment or control plants. You can then ignore them in the analysis. If you are doing some sort of chemical assay on them, then they can act as practice specimens later. Be creative!
Sample size is the bane of scientific studies. We don’t know how many times we’ve seen a scientist say that their sample size was too small. They couldn’t end up making statistical claims that they wanted to make about their studies, because they couldn’t find enough smallmouth bass, or the small proportion of sample locations with rose rocks is too small to make meaningful conclusions. Sample size issues can be devilishly frustrating to deal with, but you can overcome them.
First, how large should your sample size be? It depends on how much variation you expect to see in your dependent variable and on how many samples you think you will have to discard as the experiment progresses. If you don’t know those things, you can again ask an expert, read the literature, or attempt a pilot experiment. When asked how large a sample size should be, most scientists will tell you that it should be as large as possible, and that’s true. There are trade-offs, though. Given limited resources, you can have more treatments with smaller sample sizes or fewer treatments with larger ones. If you can manage 100 specimens, you could make a control and a treatment with samples sizes of 50, or you could make a control and 3 treatments with samples sizes of 25. Which should you do?
If you have no prior information and just want to proceed with the experiment, going with 40 individuals is not a terrible rule of thumb[2]. You can often get good statistical significance with 30 individuals, but you must assume that you will lose some during the experimental process. If you can’t get that many, do not despair. If the differences among treatments and controls are large enough, you may still get great results with only 4 samples, though that is very risky.
Finally, you’ve designed your experiment. Now, ask yourself once more, how will this experiment test my hypothesis? If you aren’t sure, either redesign your experiment, or reword your hypothesis. Both approaches can be valid. Sometimes, you lose your way somewhere in the experimental design process, but sometimes, your original hypothesis was just too hard to test with the experimental resources you have. Do what you feel is best. It’s your project.
[1] See our Excel Tutorial.
[2] The website http://edis.ifas.ufl.edu/pd006 has a decent guide to finding out how big your sample should be.
Running Your Experiment
So, you have designed your experiment and gathered your materials. You are ready to begin. When you run your experiment, the most important thing to do is to be consistent. If you used Brand X potting soil for the control plants, then use Brand X potting soil for the treatments (unless the experiment was testing the quality of Brand X versus Brand Y potting soil, of course.) Apply even conditions across all of your samples as best as you can to avoid introducing experimental bias. It seems really simple, but as the experiment proceeds, it can be all too easy to get sloppy and unevenly apply some condition. Likewise, make your measurements consistent. If you use a Brand X pH probe for some of your samples, use it for the rest. Define how you will dissect each insect to measure abdomen length and do it the same way each time. Doing things unevenly will make your results highly variable and unreliable.
One way to stay consistent, especially in multi-day experiments, is to take notes. Get yourself a nice notebook dedicated to your research project. In it, write down the times of what you did, exactly what you did, how much of a chemical you used, measurements you took, funny things you observe about your samples, and so on. We don’t think we’ve ever heard of anyone putting too many things in a scientific notebook. Do not underestimate the utility of the notebook. You can later use it to spot accidental experimental bias that you introduced or recreate part of your experiment for the sake of consistency. When you are creating your posters or papers for your project, you will probably find something you wrote down in your notebook important. Finally, the notebook is an artifact and a keepsake of your project.
Sometimes, mistakes happen that threaten to end the experiment prematurely. When that happens, step back and take a deep breath. (Go and punch a pillow and yell, too. It does help.) When you have gathered your wits and achieved a Zen-like state, consider what you can do to salvage the experiment. If you added twice as much hydrochloric acid to half of your samples, you might just redesign your experiment on the fly. You’ve been taking notes, right? You can use those to describe some of your samples as being new treatments. Just write it down and carry on. Who knows? Maybe you’ll find something interesting. If it’s early enough in the experiment, or you can afford more research, you can use much of the material from a failure, and you can certainly use the lessons so hard-won.
During an experiment, it is often critical to label your samples. If you need to redo something or just prevent yourself from double-treating and individual, labels are really important. When you take notes, you can report exactly which individual or data point was affected by a mistake. Remember to upkeep those labels, too. Organic solvents and sunlight can erase your markings, and tape can peel off. If you are working with outdoor labels, avoid permanent markers (which aren’t so permanent in the sun) and just go with pencils. Better yet, use stamped metal labels.
And those mistakes will happen. In science, the occasional spill, drop, or contamination will occur. You just have to carry on. What do you do with the would-be data points? Most often, you remove them. Keeping them around could add bias to your results. On occasion, if enough sample is destroyed, you may have to abandon the experiment, but that’s tragic. When you designed your experiment, you should have made the sample size large enough to account for data loss, so most removals should not impact your study too much. Do not be too cavalier about this aspect of running your experiment, but take some comfort in having accounted for the possibility.
When you have completed your experiment, keep as many of the materials as you can. You might find unusual results later that are unaccounted for, and you can double check that there was not something physically wrong. Your peers may also be interested in exactly what you did. For science fairs especially, artifacts of your research can be very useful for your presentation.
Interpretting Your Results
Interpreting Results
You have done your statistical analysis, and you can show that two things are likely related (or not.) You just did the mechanical part, the boring part, the thing that does not yet have the inspiration of human spirit. Here you are now, results in hand. What in the world do you do now? You have to interpret what you have found. You have to take the results and tell yourself and others what it means, what the implications are, and how this changes our understanding of the world.
Statistical Significance
If you performed a test that told you that two or more means were statistically different, then you need to explain why you think that they are. Were you expecting them to be different? Were you expecting them to be the same? How different are they[1]? If your tests did not show statistically significant differences, then what do you make of them?
These questions can dog professional scientists, and honestly, there is no hard and fast rule that you can use. If you have a p-value of 0.049 (just under the 0.05 threshold), then you have shown statistically significant differences according to conventions, but you have barely done so. When you interpret that difference, you need to acknowledge that it is a weak one. Perhaps, more replications of your experiment will clear this problem up, but you may not have the time or resources to do more experimental work.
The bottom line is that you can make a claim that your mean or proportion is different with statistical significance. You will have to interpret what that means or implies. Be sure to relate what you found with what others have found in similar systems or with other measures in your system. If you found that the selenite crystals in a local park have greater specific gravity than is normal, you will need to speculate and perhaps do follow-up chemical tests to determine why it is so different.
Correlations
If you did correlation tests, then you will need to interpret what those correlations mean and suggest what other tests need to be considered. In general, correlations show that two measurable things vary with each other or against each other (or not at all.) You have all heard the phrase, “Correlation isn’t the same as causation,” or something to that effect. This statement is true, but correlation can imply causation.
When two things (A and B) correlate with each other, we can come away with four different conclusions: A causes B; B causes A; something other than A or B simultaneously causes both of them; or A and B correlate due to coincidence. Scientists are not interested in the last conclusion, except that they need to rule it out. Let’s start with the first two conclusions, A causes B and B causes A. You will have to use reason and common sense, along with whatever evidence you can find, to tease out which is true[2].
Figure 1 : Recent Oklahoma earthquake frequency.
Consider Oklahoma’s recent spate of earthquakes. Starting in 2009, the frequency of earthquakes with a magnitude of 3.0 or higher has gone up dramatically (Figure 12). As many have observed, this date is the approximate date that waste injection well use in fossil fuel extraction also increased[3]. The two things correlate. Let’s compare A causes B to B causes A. Would it be reasonable to say that injection wells could cause earthquakes? Waste injection wells push liquid deep underground at high pressure, which geologists surmise could alter the state of underground faults, thereby initiating earthquakes. Could we suppose that earthquakes cause injection well use? That would be absurd, of course. No one feels the ground shake and then decide to go and drill because of that.
Could something else cause both the recent increase in earthquakes and increased waste well injection? That seems almost as absurd as earthquakes causing drilling, so we should rule that out. The final possibility is that these two things are mere coincidence. That is not as absurd, and it should be seriously considered. The joint study performed by the United States Geological Survey and the Oklahoma Geological Survey concluded that it was unlikely to be coincidence after more rigorous statistical tests and the use of information from other locations, but you should always consider the possibility that correlations are coincidence.
Frequently, two observed trends will correlate, because some other factor is driving them. Sour foods can lead to tooth decay, if you’re not careful. How does sour hurt your teeth? Well, you taste acidic foods as being sour, and acids can wear away at tooth enamel. Both the sour flavor and the tooth decay are caused by acidity. Let’s try another. Sometimes, silly hats and fireworks co-occur[4]. While silly hats could lead to fireworks, it might just be that it’s New Year’s Eve. People like to wear silly hats on the holiday, and people like to display fireworks on certain ones, so both silly hat wearing and fireworks are caused by people’s reaction to New Years Day. Okay, one more. Alligators and cypress trees correlate. Does one cause the other? No, they both prefer swamps. When you look at your data, make sure that you find instances of these things.
Figure 13: Correlation sometimes has nothing to do with causation. Say what you will about Microsoft Internet Explorer, but it would be unreasonable to assume that it caused murders or that few murders caused people to stop using the software. Source: Gizmodo.com
Now and then, correlation is total happenstance. Two things have positive or negative correlations, and it has absolutely nothing to do with them relating to each other. Sometimes, it’s really easy to tell when that is true (Figure 13)[5]. As with finding common causes for correlating variables, uncovering when they are simply happenstance is also an exercise in common sense and reason. If you cannot explain why two things are related, then you should assume that they are not. In fact, designing experiments to test whether two things are actually related and possibly why is a great enterprise in-and-of-itself.
Context
Science is not conducted in a vacuum. Your results relate to someone else’s work, and you worked on something that interacts with something else. You need to put your results in the context of those things. Here, your research of the systems and methods you used in your experiment will pay further dividends. What did other authors say about their systems? What did they say about the results they got? Borrow[6] those ideas and make them your own.
There is not any one-size-fits-all advice for finding the context of your work. It is generally a matter of knowing why you researched, hypothesized, experimented, and analyzed. Here are some tidbits that we can provide you, though:
- If another experimenter used the same methods that you did on another similar system, and you share results, what did he or she conclude? Do you think that you can make similar claims?
- What have other researchers discovered as unique about the system that you are working on? Could they be relevant? Why or why not[7]?
- How will this work impact what we understand about other systems related to your system?
- Is there an impact on people[8]?
- Does anything make you wonder what is really happening? You could have found something unusual that you can encourage others to look into.
When science is done, you test a hypothesis to challenge a theory. A theory’s resistance to those challenges is what makes it strong. Sometimes, when you have done your work, and you found no differences between treatments like you thought you should, you are tempted to despair. The lack of differences is called “negative results,” and even the wording implies failure. You have to put that into context, too, however. Do those negative results support or challenge the theory? Maybe they imply that the theory is not as strong as others think it to be. Depending on how well supported that theory is, you may have shown that it does not apply to your system, to a class of systems, or to anything at all. After all, if only one experiment in all of history has supported the theory, you may have just cast enough doubt on it to disprove it. On the other hand, if it has been supported by millions of prior experiments and has been shown to explain phenomena time and time again, it’s probably your experiment.
Negative results are difficult to make conclusions from, but they are an underappreciated part of science. It’s hard to get them published in the more prestigious journals, and many scientists don’t bother submitting papers about them to even the least of journals[9]. That’s a shame, really. If nothing else, it prevents other scientists from trying the same experiment when they could be doing something else. As a high school student, you should be proud of your results, even if they are considered negative. You might have discovered something new anyway.
[1] Since you are working on a high school student’s budget, if your results are significantly different, then they are probably meaningfully different. In studies with many replications and huge sample sizes, statistical differences can be found that are so minor in real world terms that they don’t really imply any worthwhile conclusion. This is most common in gigantic medical studies.
[2] You know, both could be simultaneously true. This would be an instance of feedback. Consider one plant shading another plant. They both need sunlight to grow, and the bigger plant is getting more of it. Because of that, it can grow faster, continuing to shade the smaller shaded plant. Thus, the shaded plant’s small size is preventing it from getting sunlight (small size is causing the plant to be shaded by larger plants), and the shade is stunting growth (shade is causing the small size.) Plants have means of escaping these situations, of course, but this might be true for some plants some of the time.
[3] Some modern fossil fuel extraction has a lot of waste water that needs to be disposed of. Drillers simply inject it deep underground to prevent it from harming the environment.
[4] Scientists often use the words “occur” and “co-occur” in ways that are kind of funny to non-scientists. You might say that wolves live in the forest, but a scientist would write in a paper that they occur there. If two things happen together or live together, we will say that they co-occur. This often implies correlation.
[5] Here are lots of spurious correlations that you can waste your time justifying.
[6] We say borrow and make your own, and that’s important for two reasons. If you want to be good at what you are doing, you have to internalize the ideas you are using and coming up with. This is a matter of practice and talking to other people. When you have aspects of your work internalized, you can really shine. Also, it will help you to avoid plagiarism when you write. Take note of where you got ideas, but also having them as your own ideas will help you to write original text, and that is the essence of avoiding plagiarism.
[7] Generally, if something isn’t relevant, you don’t need to report that it is. There are cases in which people have assumed something to be relevant, and research showed that it was not. Maybe you found something like that. If so, you’ll want to report it.
[8] Not all science is immediately relevant to human welfare, and that’s okay. If you can make a claim that yours is, though, it tends to get folks’ attention. As a dispassionate scientist, you shouldn’t care about that, but as a dispassionate scientist who needs to get a grant proposal accepted in order to get a paycheck, you probably should care. Just saying.
[9] Peer-reviewed scientific journals represent the most important way that scientists communicate their results. They are basically magazines (more online entities these days) that contain articles describing what scientists did and what they found. Each article has been examined by other experts in the field and vetted for its rigor and importance. Very prestigious journals such as Nature, Science, and Proceedings of the National Academy of Sciences have a very high bar for what they consider important. Every field has its own set of journals ranging from very prestigious to those that accept any paper, as long as the science was rigorous enough.
Communicating Your Results
Communicating Your Results
The final step in science is communicating your findings. While the public thinks of scientists as people who work in labs and who write confounding equations on chalk boards, scientists are also writers and speakers. You can do all of the lab work, field work, observations, calculations, research, and analysis you want, but if you never get your work out for, you may as well never have done your research. Science fairs are excellent ways to practice this aspect of science, and we hope that you enjoy communicating your results. We will discuss the primary ways that scientists communicate: with writing, with posters, and with presentations.
Writing
Good writing is a skill that you can use in many walks of life, and even if you never really use it professionally, you can use it personally and even romantically! How can science writing be romantic? Mostly, it can’t, but writing is writing, and being good at one kind can help you be good at other kinds. In your English classes, you should be learning all of the rules of grammar, punctuation, and spelling. Those are reasonably universal, and even when you write poetry or the sloppy slang of your fictional cowboy, it really helps to know what rules you are actually breaking (since you can break them better that way.) Just developing the swagger to put words onto your page is really helpful. Learn how to write, as it is a great skill.
Before digging into the nitty-gritty of science writing in particular, it is worthwhile to relay some advice to you about writing in general. We’re not going rehash, subjects, predicates, gerunds, and commas, but you should remember all of those rules. It saves time in the editing process, and it makes you look smart. Having graded some college papers, we can tell you that students make very different kinds of mistakes when they write. While one student will frequently use incomplete sentences, another will have run-on sentence after run-on sentence. You have to know your strengths and your weaknesses. Remedy those grammatical weaknesses, and you will save yourself a lot of grief.
As is true with many things, practice is necessary to develop as a good writer. While you may only produce a few papers for your classes every semester, you should consider come other ways to practice.
- Write for fun and profit. Make up short stories, poems, or opinion articles. Even if they’re not good, it’s still practice. Maintain a blog or a gallery of your work. As you write, you will understand writing more, and if you do it for fun and not in fear of teacher’s red pen, you can better associate writing with pleasure.
- Use good writing in text messages and on social media. Instead of abbreviating everything and not caring how your sentence comes off, make a solid effort to use proper grammar, spelling, and punctuation with your peers. Your generation writes a prodigious amount of text, so you have no excuses for writing it poorly. If your friends make fun of you for doing so, just tease them for their bad spelling and move on. Writing well should be a habit.
- Read. Read books, magazine articles, and blog posts. Read fiction, poetry, science, and history. When you read, have opinions not just about the content of what you are reading but the quality of the authorship. Copy styles you like and understand why you don’t like what seems bad to you.
- Read a few writing guides. John’s very favorite is “Politics and the English Language” by George Orwell, but there are lots of others out there. The National Center of Biotechnological Information (NCBI) also has a guide to writing scientific papers.
When you write, you create a voice. Just as it is with a person’s audible voice, there are lots of ways to describe a written voice[1]. You can also write in different voices depending on your audience and your intent. When we write scientific papers, we follow a very different style and present a very different voice than what we are writing with here. In this work, we are trying to be familiar and warm. We are avoiding too many unnecessary technical terms without sounding condescending. We can use the words “we” and “you” with abandon. Scientific papers are very different. In those, we have to be precise and very detailed. The writing is colder, but that’s okay, because the purpose and the audience are different. It is more important for a scientific reader to very clearly understand exactly what we mean by something, so there is more jargon. The priorities are different, and you have to appreciate the different motives of the reader.
Here are some scientific papers that will be relatively easy to read based on their content and style:
- Curley LP, Hoseney RC. 1984. Effects of corn sweeteners on cookie quality. Cereal Chemistry 61: 274-278.
- Eide ER, Showalter MH. 2012. Sleep and student achievement. Eastern Economics Journal 38: 512-524.
- Cvetkovic C, Raman R, Chan V, Williams BJ, Tolish M, Bajaj P, Sakar MS, Asada HH, Saif MTA, Bashir R. 2014. Three-dimensionally printed biological machines powered by skeletal muscle. Proceedings of the National Academy of Sciences 111: doi:10.1073/pnas.1401577111.
- Lichti NI, Steele MA, Zhang H, Swihart RK. 2014. Mast species composition alters seed fate in North America rodent-dispersed hardwoods. Ecology 95: 1746-1758.
Please note that some of these are likely behind a paywall. To get ahold of them, you may need to contact a local college. Your teacher may be able to help.
Let’s get down to brass tacks in discussing what you need to do when you write a scientific article. Overall, you need to be precise, explanatory, and concise. It needs to be clear what you did, why you did it, and what you found out. The whole point of the exercise is to explain all of the little details, but you should also be fairly brief. Like I said it takes practice. We recommend reading some good scientific articles to learn about this.
The standard scientific paper goes something like this: title, abstract, introduction, materials and methods, results, discussion, acknowledgements, and literature cited. Most papers aren’t actually written in that order, though. Instead of going over them in order of when you see them in a journal or in your final paper, we will go over them in order that you should write them.
Start with the introduction. Even though it’s not the first thing you see when you look at a paper, it’s very easy to write it first. The introduction should cover some basic information about the system you are working on, what sorts of related things that others have done with your system, what sort of problem you are addressing, and a hint at what you found. For a science fair project (or most science writing in general) the introduction should be short. Most are 4 to 8 paragraphs long. You can write an early draft of your introduction before you begin your experiment and use it as a personal guide for how you will think about your project.
Next, write the materials and methods section. Herein, you will need to describe what kinds of samples you are working with and where you got them. You will need to describe the conditions you used, the ways that you did your experiment, and any special methods of analysis you used. Materials and methods sections are unusual in that they are often written in the passive voice. This is because you are the one who did the things to the samples, and a lot of scientists avoid using first person writing when they write anything, but this is changing. Either, “50 mL of water was added to the solution, and pH was taken again,” or, “We added 50 mL of water to the solution and remeasured pH,” are usually acceptable. Check with any guidelines about this for your science fair before you continue.
For the results section, you will need to include numbers, observation, and so on. You will need to include information about statistical significance, if that is important, as well as brief interpretations. You will likely reference your most important figures and tables in the results section as well. Keep your sentences simple, since you are only stating fact here and not in depth analysis. Like materials and methods, results sections are often broken down into subsections describing the results of different tests. Give each one a heading.
Many papers include a conclusion section. Sometimes, it is required by the journal. A conclusion is typically 1 to 3 paragraphs that distill the most important points raised in the discussion. If you include one, keep it brief and poignant.
The discussion section is the coup de grace of your paper. In the discussion, you get to tell your readers what this is really all about. You will take the most important results and explain why they matter, what they could really mean, and what sorts of questions they unveil for future investigation. You will need to defend your claims here, and since science is about disproving theories, you will need to suggest some potential problems and pitfalls relating to what you discovered. Then, you will need to explain why they might not be valid or relevant for your research. In the discussion section, you will need to put your neck out and really stand for your results.
There is a little section that you can write any time you like, really, but it usually follows the discussion called the acknowledgements section. This is usually a very brief paragraph that thanks any non-author who helped in gathering materials, running experiments, giving key advice, creating graphics, etc. If you received outside funding, you should explain who gave it to you, and provide an agreement number, if that is applicable.
Combined, the introduction, materials and methods, results, and discussion comprise the body of your paper. Now, it is time to squish them all together into one paragraph, the abstract. While this is one of the first things that a reader will encounter, it should be one of the last things you write. The essential gist of the abstract is that it explains what is important about the problem you are addressing, how you approached the problem, what the headline results were, and what their implications are. Here’s a basic approach to writing an abstract[2]:
Here are some titles from an issue of the Proceedings of the National Academy of Sciences (PNAS):
- Some inconvenient truths about biosignatures involving two chemical species on Earth-like exoplanets.
- Radiometric 81Kr dating identifies 120,000-year-old ice at Taylor Glacier, Antarctica.
- Switchable S = 1/2 and J = 1/2 Rashba bands in ferroelectric halide perovskites.
- Juvenile hormone regulates body size and perturbs insulin signaling in Drosophila.
- Field experiments of success-breeds-success dynamics.
Note that some of the titles are more accessible than others. It largely depends on whom the authors wish to reach.
Use one or two sentences to describe your system. Just highlight the aspects of the system that you are addressing.
1. Use one or two sentences to describe your methods. You don’t need to include specific numbers like how many mL of water you added to each sample, unless that’s what you were varying. You need to include kinds of materials you used and the ways you varied your methods.
2. State what your basic results are in 1 to 3 sentences. Only include headline numbers. If your results are more qualitative, then do not worry about the specific numbers.
3. State what this means to the field of study in one sentence. This final sentence encapsulates what it is you did, and it is the single most important sentence of your entire paper. What are your implications? What has this changed about how we understand the world? No pressure.
Every sentence of your abstract should be an active and bold sentence. It should be clear and concise. Save your uncertainties for the discussion section and distill everything down to its fundamentals.
You have one last little bit to write: the title. Most authors write this last, though if a pithy or really effective title comes to mind at any point during the process, write it down, and it could work. There are a lot of approaches to writing titles, too. Some of them simply state the finding in a sentence. Others describe what the authors were investigating. You will find the occasional title filled with puns. While there is a lot of flexibility, you should carefully examine what it is that you are trying to do. Puns and humor get attention, but do they get the target audience interested in your topic? If you give away what you found, will anyone be interested in reading the rest of your paper? You just have to figure out what is most appropriate and what you think best reflects your work.
Academic writing, and scientific writing by extension, requires literature citations[3]. Here’s the deal: in a scientific paper, when you make a claim, you have to back it up. There are three ways to do that: use pure reasoning, use your results, or cite literature. Pure reasoning is helpful, but it’s not actually that common in scientific papers, because science is largely evidence-based. You already know about reporting your results. Citing literature is just you saying, “Hey, this guy already showed that this is true, so I don’t have to totally rehash whatever it is he said.”
Literature citations serve several purposes. First, they show you did your homework. Second, they show that what you have to say is backed up by the scientific community at large, placing your own work into the larger scheme of science. Third, they pointreaders to other articles that they might be interested in. Consider when you were doing the literature research putting together your experiment or even writing the paper. You might have read a paper and seen their own literature citations, and those pointed you toward something else that is useful. You need to pass on the favor.
Every journal has a style for literature citations. In the text, there are two common ways to call out a citation. The most common is to use the author and date. You can make a claim and then finish it with “(Smith 2012),” “(Chu and Mustafa 2009),” or “(Talbot et al[4]. 1993).” If it makes a sentence work better, you can write something like, “Smith (2012) reported that…” That is an especially useful way to keep the active voice when you write. The other format that is occasionally called for is the use of numbers. Numbers between parentheses or in superscript[5] are used after phrases to indicate which reference in an enumerated list you should use. Keeping an active voice when using these or even talking about them can be difficult, but you can use something like, “A previous study134 reported that…” Note: if you have more than 100 references, you need to cut back.
The format you use for the literature cited section varies widely across journals, so you will need to look up what your science fair guide tells you to do. Each entry will include the authors’ last names and initials, the year of publication, the title, and the work it was published in. Note that journals, books, theses, conference proceedings, and so on will require different formats within the literature cited section. It can take some work getting them right, so be prepared for that extra time.
Most scientific works include maps, charts, photographs, drawings, and other images. Collectively, these are called figures, and they are really, really important. Even a serious reader of your paper will gravitate toward the figures first. Figures can be referenced in any part of the body of the text. If you are formatting your own work, you will need to keep the figure near the text that first references it, but your publisher may request you include all of your figures at the end of the document.
These days, maps are relatively easy to generate. You can grab screenshots from satellite images using software like Google Earth[6]. If you are mapping a study location, you can use drawing software to generate proportional lines and features. You can even use something like PowerPoint to create maps with its drawing tools! If you use a map, you need to justify why you are including it. Are you showing where your study sites are? Ask yourself if it’s important for the reader to see exactly where they are or how they are laid out. It might not be especially important for your conclusions. Sometimes, you can use maps to show results, and if you can, bully for you. Maps are a truly great way to relate results to the real world.
There are a lot of types of charts, and it can be a bewildering thing to figure out which charts you should use. Here is a list of the most common charts your will see with the reasons that you might want to use them:
- Bar/Column Chart. These charts include masses representing data from different categories so that the reader can visually compare magnitude. When it’s important to display how different treatments produced different (or similar) magnitudes in a response variable, you should use a bar or column chart. If you have statistical significance information, you can also include error bars that can indicate how different your responses are from each other.
- Line Chart. These charts typically show how a response variable changes over time or space. The x-axis needs to involve a variable that changes continuously for your sample. Too often, people use line charts when they should use bar/column charts or scatter plots.
  - Stacked Bar/Column Chart. This variation on theme includes multiple contributing factors to response variables. You might show increased vegetation for the treatment in general, but with a stacked bar/column chart, you can show which kinds of vegetation increased along with the overall trend in one figure.
- Scatter Plot. Use scatter plots when you intend to show how two variables relate to each other. Most often, scatter plots are used for correlations, in which case, you may want to include a trend line as well as an r value or R2 value.
- Pie Chart. Pie charts are used to show what proportions different categories make up of a whole. They are excellent for surveys of wildlife, soils, opinions, and such.
Many papers include photographs or drawings of specimens or apparatuses. If you want to include some, consider why you are doing so. Do you need to convey how you did an experiment, and words just won’t do the job? Are you describing the feature of interest some test subject and need to display a typical example? If you answered yes, then you should include a photograph or drawing. Keep in mind that most papers do not include them, and those that do use them sparingly[8].
A drawn model is one special kind of figure that is useful in some more conceptual papers. Sometimes, scientists include cartoon[9] drawings of their systems or hypothetical connections among different components of bigger concepts. These might appear as flowcharts or Venn diagrams. If you can describe your theory using a drawn model, then you might be able to better convey your ideas. If you can’t, do not fret, for such things are not found in most papers, even most great papers.
Tables are like figures, but papers tend to treat them as a separate class of things. Tables can be used to organize materials and methods or to display results. In general, tables are good at displaying relatively small quantities of numerical or textual information that are really boring to try to explain in the text. You can take any table ever created and describe its contents in words only, but that text would be really boring. You should then use tables to organize basic information for the reader’s sake.
How do you make a poster? Most scientists use PowerPoint to make the poster, since it’s easy to move graphics and boxes of text around a slide. If you use PowerPoint, make sure that you set the slide to be the appropriate height and width.
For all of your figures and tables, you will need captions. These little bits of text need to describe how to read their reference and not necessarily interpret what the item says. You will do that in the body of the work. More elaborate figures need legends for your reader to quickly translate colors and symbols.
Posters and Displays
While the public perceives science fair projects with their trifold posters and experimental representations to be the realm of the elementary or high school student, professional scientists more or less do the same thing professionally. We often see bewildered looks on the faces of non-scientists when we describe the quintessential scientific activity of the poster session. We scientists often take our work and sum it up on a 4’×3’ printed poster that we take to a scientific meeting and stand there while other scientists wander by with plates of hors d'oeuvres to consider and discuss our findings. It is really not unlike a science fair. Indeed, the poster or display may be your most important final product for your science fair project, so you should really consider what you are doing at this point.
You might not have this problem, but many scientists struggle with their posters. Few of us have been formally trained in graphic arts, so we first approach our poster as a paper, and that is quite bad. If you walk the halls of a science department a major university, you might encounter some posters that were considered afterthoughts to the all-important papers[10] on which they were based. These posters are difficult to approach, since they are gigantic columns of text spread out on a big piece of glossy paper.
That leads to the first important lesson about the poster that you will no doubt include in your display: base it around your graphics. Your key charts, photos, and tables will draw the eye before your big and bold title will, so they have to be the centerpieces of your poster. They need to be big enough to decipher from four or five feet away with clear captions. While some photographs like pictures of you and your peers laboring away at your study site are not appropriate for papers, they often make great fillers of space and conversation starters when they are appear on posters.
Still, posters need text, and you will have to do some writing. You will need a title that catches the attention of your audience (and especially your judges.) You are more free to be a bit funny with poster titles than you are with paper titles, since you will need to be approachable as you man your display. You will also need to include the names of the authors of the poster, typically right under your title. While the text sections are less formal than one finds in papers, they are usually broken down into introduction material, materials and methods, results and discussion, and conclusions.
In the introduction, you will need to include very brief background material for your project. Limit this to six sentences maximum. You will also need to show a goal or hypothesis that you tested. Again be clear. The materials and methods should be extremely short, listing the kinds of tests you performed and the kinds of materials you used. Results and discussion sections are usually combined into one section to briefly highlight what important things were found and how they related back to the original goals of the experiment. Conclusions are usually limited to four to six sentences.
Figure 1: A poster for a professional scientific conference. Notice the layout that includes clear sections and columns. The photographs catch the eye, and the graph is attractive. The text is broken up into clear bullet points for easy reading. Courtesy of Marie-Eve Jacques and Stephen Hallgren, Oklahoma State University.
You can use bullet points instead of flat text on your poster, since you are only highlighting important information. This method can save you a lot of writing effort, since you needn’t concern yourself with transition phrases and the like. Readers tend to like bullet points, too, since they draw the eye. How many times have you seen paragraphs with text interspersed with bullet points and skipped straight to the bullet points?
Pay attention to fonts and font size. In the world of fonts (or typefaces, as they are more accurately called) there are two classes: serif and sans serif. Serif fonts (like Times New Roman, the font in which this text is written) have little feet adorning them. In general, they are considered to be easier to read in large quantities, as you would find in a book or news article. Sans serif fonts (like Arial or Helvetica) lack these and are considered to be more attractive for things like signs and titles. You can consider using a serif font for the text blocks and bullet points and a sans serif font for the title and headers, but if you do, make sure to play with them until they seem to match. Font size should be large enough that your less youthful instructors and judges can read your sections without overstraining. I would recommend keeping most text about at an 18-point font. Make sure that your font sizes are consistent amongst sections and headers, too.
You need to be especially careful in making the widths and margins of your poster consistent. The easiest way to do this is to pick numbers and use the number fields instead of manually moving them around your poster with your mouse. Text boxes and figures may look okay on your computer screen, but you need to ensure that they are consistently proportioned and positioned, since after they are blown up, those small differences become big differences.
Presentations
Oral presentations are the third way that scientists present information. These days, we speak to groups of people using PowerPoint slideshows, possibly demonstrating aspects of our research with props. You may have stage fright, and you would not be alone. Imagine carefully assembling your results and conclusions for an audience who will stare at you as you talk about them for about twenty minutes. Now, imagine that they will ask critical questions about what you did (and didn’t do.) That’s giving a presentation, and we don’t mean to scare you, but you have to prepare yourself emotionally as well as technically for the exercise.
Your talk will be divided in a similar way as that of a paper, but since the reader can’t skip around or go back as you give your presentation in real time[11], you will have to very deliberately go through certain steps. First, present the basic system you are working on, describing what is known about it and any interesting information you know about it. You can talk about a few things that you wouldn’t mention in a paper, since you have to entertain as well as inform, so now is the time to mention how many cool things your research subjects can do.
After your background information has been covered, you will need to describe what your objectives are and what you hypothesized. This step informs your audience of the scope of your study and what they should look out for as you progress through your story. You should then spend a short amount of time on your materials and methods. Students[12] are frequently tempted to spend far too much time on materials and methods, since they feel more intimately familiar with them. Just talk about how you approached the problem briefly. Only linger on parts that you think your audience might find novel or difficult.
You should focus the bulk of your presentation on your results, because this is the most important part of a talk. You will need to communicate exactly what you found and what your statistical analyses uncovered. Be descriptive and interpret your numbers into real-world terms that your audience can appreciate. Next, summarize what you found and speculate about the meaning of your work. That speculation can bring out later dialog, so don’t be shy about it. You can then move on to suggest what needs to be done next to address what you speculate. Finally, thank your collaborators and helpers and open yourself up to questions.
John’s anecdote: I once made a slideshow for a minor presentation, and the file had to be converted from an OpenOffice format on a PC to a PowerPoint format on a Mac. All of my text disappeared, leaving only my graphics. The presentation went swimmingly, partly because I focused all of my attention on describing the figures.
Typical scientific talks involve accompanying slideshows (almost always made in PowerPoint.) Since we are visual creatures, it is important that your slideshow itself be very good—appealing, not distracting, informative, and supportive of your words. Here is some miscellaneous advice for making good slideshows:
- Minimize text. People don’t read and listen well at the same time. Any text you include should simply outline what you are going to talk about. There are exceptions, of course, but few slideshows are improved through the addition of more text.
- Try to include at least one graphic per slide. If people have something to look at on the screen, then they will do so. It is especially important to include lots of charts and such as you present your results. People like to see what they are hearing.
- Use big fonts. Projectors vary in quality, and people often sit far away from the screen.
- Transitions and animations are nice, but don’t overdo it.
Giving presentations well takes experience, practice, and raw talent. You are young, so you must be forgiven for inexperience, and you cannot help it if you lack raw talent (but bully for you, if you have it.) You can, however, practice. There are several reasons to do so. First, you can find the spots in your presentation in which you are unsure of what to say and fix them. Everyone has these points in their talks in which they are not sure how to transition from one topic to another, and you can practice different approaches. Second, things that often sound good in our heads sound really stupid when we talk. Third, you have to get the timing right. If you are given 20 minutes for a talk, you need to make sure you time it right. Also, you can get a lot of help through practicing with an audience of people who want to watch you succeed like your parents, teachers, and friends[13]. They can give you good feedback to improve your talk.
When you present, you need to do several things to look and sound good. Make sure to wear appropriate clothing and be well-groomed. When you speak, try to suppress the urge to fidget or pace. Tics can be hard to suppress, and everyone has their own, but you can fight them and do a better job. Speak confidently. You may not feel confident, but you have to pretend that you are. Smile. If you force yourself to smile and project positivity, eventually you will internalize that. You may need to modulate your speed, and that is what practice is for.
[1] Frequently, we talk about voice in writing in terms of active and passive. In the active voice, subjects do things. In the passive voice, things are done to the subjects. “Plants are taller with more fertilizer,” is an active sentence. “Addition of fertilizer has resulted in taller plants,” is a passive sentence. Avoid the passive voice. Embrace the active voice. Too many scientists use the passive voice in papers, and that just makes me sleepy. Don’t make me sleepy.
[2] I have proposed a basic approach here, but occasionally, different journals, or perhaps your science fair guidelines, will specify how they want to abstract to pan out. Typically, the only requirement is a maximum word count, anywhere from about 150 to 300 words. Sometimes, the journal will dictate style and form.
[3] John’s note: when I was a younger student, I hated citing literature, and that was probably because I didn’t understand what I was doing. Now, I’m kind of addicted to them.
[4] Et al. is an abbreviation for the Latin phrase “et alii” which means “and others.” You use it when there are more than two authors. A lot of papers have three or more authors. Some papers discussing really large projects like draft genomes or gigantic particle physics accelerators can have dozens of authors.
[5] Like the number referencing this footnote.
[6] Make sure to cite your use of mapping software and images.
[7] Technically, bar charts have horizontal bars, and column charts have vertical bars, but people often refer to column charts as bar charts.
[8] One major reason for this is that journals like to charge for colored figures. They like to charge a lot, and black and white photographs might not be good for what you are showing.
[9] Not silly ones like Garfield, The Far Side, or xkcd… usually.
[10] And yes, peer-reviewed papers are more important, but good posters promote good dialog, and good dialog promotes good science.
[11] This isn’t true of recorded presentations, of course, but even when you give one, you will likely have a live audience. Webinars are relatively recent sorts of presentations. The word “webinar” is a portmanteau of World Wide Web and seminar. Webinars are essentially virtual meetings, and they can often be recorded for future audience members.
[12] This includes graduate students, so don’t take it too personally when we say “students.”
[13] You can use pets as audiences, too. They are pretty non-judgmental, though they can be rather insulting when they just curl up and take a nap during when you reach a really cool figure.
Glossary of Terms
Abstract – In a paper, a short summary of the full paper.
Alternative hypothesis – In statistics, the hypothesis that one mean is larger than, smaller than, or unequal to another mean with statistical significance.
Analysis of variance (ANOVA) – In statistics, a way to test multiple means against each other for statistical significance.
Bar / column chart – A chart with classes of data and their magnitudes.
Binomial data – In statistics, data that is an either/or proposition, for example, 0 or 1, or yes or no.
Chi-square (Χ2) test – In statistics, a test to determine whether quantities of different classes match expected quantities
.
Confidence interval – In statistics, the upper and lower bounds of the estimation of the population mean, given the sample data.
Controlled experiment – An experiment in which one sample is set up as a control, and other samples are set up to vary one or more independent variables.
Degrees of freedom – In statistics, the number of values in the final calculation of statistic that are free to vary.
Dependent variable – The variable that is measured as the result of the experimental effect.
Discussion – In a paper, the section that is used to describe the impact of results and speculate about their meaning.
Error bar – On a graph, the upper and lower bounds of a confidence interval.
Experiment – A systematic approach designed to test a hypothesis.
Experimental bias – An undesired effect leftover from accidents in the experiment.
Experimental sample / treatment sample – In an experiment, the sample(s) that has (have) variables different from the control sample.
Figure – In a paper, poster, or presentation, a chart, photograph, drawing, map, etc.
Histogram – A special kind of bar / column chart that compares the frequencies of ranges of a variable.
Hypothesis – A testable idea that could explain an observed phenomenon.
Hypothesis test – In statistics, a test to determine whether two estimated means are different from each other with statistical significance.
Independent variable – A variable that is altered by the experimenter or otherwise varied across treatments.
Introduction – In a paper, the section that describes the system and introduces the problem addressed.
Literature citation – In a paper, a callout to previously produced work used to support a point.
Margin of error – In statistics, the value added to and subtracted from the sample proportion to generate the upper and lower bounds of the confidence interval.
Materials and methods – In a paper, the section that describes sources for materials and the parameters of the experiments.
Mean (arithmetic) – In statistics, the sum of all variable values for individuals in a sample divided by the number of individuals in the sample.
Median – In statistics, the value of a variable for one individual separating the upper half of the distribution from the lower half of the distribution.
Mode – In statistics, the most common value for a variable in a sample.
Natural experiment – An experiment comparing existing natural samples using natural variation to determine independent and dependent variables.
Noise – Random variation in a value.
Null hypothesis – In statistics, the hypothesis that two sample means are not different from each other with statistical significance.
Outlier – A value for an individual’s variable that is unusually much larger or smaller than the mean or otherwise appears to be non-normal.
p-value – In statistics, the probability of getting a test statistic result at least as extreme as what was observed, assuming that the null hypothesis is true.
Pearson coefficient – A measure of the linear correlation between two variables for a sample.
Pilot experiment – A miniature experiment used to test the viability of and potential pitfalls for a real experiment.
Plagiarism – Unattributed use of another’s text or other materials.
Probability of success – In a binomial variable for a population, the likelihood that an individual will have a particular value.
Results – In a paper, the section that describes what the experimenter(s) found.
Sample – A group of individuals that were all exposed to the same treatment.
Sample size – The number of individuals in a sample.
Scatter plot – A chart in which individuals are organized for two variables, one of which varies on the x-axis, and the other of which varies on the y-axis.
Sample proportion – In statistics, the proportion of individuals that have a certain value for a binomial variable.
Sample Variance – In statistics, a measure of the variation in a variable across the sample. In square units of the original variable.
Signal – The non-random variation in values that conveys useful information.
Standard deviation – In statistics, a measure of the variation in a variable across the population. In the same units of the original variable.
Standard error - In statistics, a measure of the variation in a variable across the sample. In the same units of the original variable.
Statistical significance – In statistics, having a low probability of randomly reproducing at least as extreme a result as found within the data.
System – The organism, mineral, interaction, etc., that you are working on in your research.
Treatment – A set of conditions under which an experiment is being performed.
Variation – The difference or divergence within or among samples.
Guide for Using Excel for Statistics and Charts
Guide for Using Excel for Statistics and Charts
This is a quick guide for using Excel (2007 and later) to solve statistics problems and create charts for you science fair project. This guide is intended to be a starting point for you, since every project is different. This guide also assumes that you have access to the document “Description of the Scientific Process.”
Basic Terms and Concepts
Figure 1: Cells in Excel. The active cell here is A1.
To start with, you will need to understand some basic terms and concepts about Excel. If you are totally unfamiliar with the software, then this guide might be too advanced for you, but there are many great guides available online for this ubiquitous program. Excel is a spreadsheet[1] program. As such, each open sheet has a grid of cells with alphabetic column headers and numeric row headers. Cells can be identified by their coordinates. The cell A1 is the top left corner cell; A2 is its neighbor to the right; and B1 is its neighbor below (Figure 1). You can fill cells with numbers, text, and formulas.
Figure 2: Sheet selector.
By default, a new Excel spreadsheet will have three sheets. You can remove, add, or rename these sheets, and they can reference each other (Figure 2).
When you enter your data, make sure you have a plan. In general, have a column for individual identification labels and columns for measurements, dates, and other notations that you make. If things look messy, you may want to move some things over to another sheet. As you work, familiarize yourself with the copy/cut/paste functionality, as well as insertion and deletion of columns and rows.
Excel functions are special formulas that you can input in order to make calculations or a few other specialized activities. Anytime you begin filling a cell with the equal sign (“=”), you are telling Excel that you want it to do something. Fill the cells A1 and A2 with numbers. Now type “=A1+A2” into any other cell. You will see the sum of those two numbers. “A1-A2” yields the difference; “A1*A2” reveals the product; and “A1/A2” calculates the division of the value in A1 by the value in A2. You may use parentheses to make more complicated functions.
Figure 3: The Function Library.
Excel provides a whole host of formulas that you can use to perform all sorts of calculations and other manipulations. In general, each formula is written as =FORMULA(argument1,argument2,argument3,…). Each of those arguments is specific to the formula you are using, and Excel will help you find out what you need in those fields. You can find the full list of formulas in the Function Library on the Formula Ribbon (Figure 3).
Frequently, you will use the notation “Column1Row1:Column2Row2.” You can use this to select multiple cells for a given for a given function, something that is necessary for many of them. As an example, “A1:B2” selects the four cells A1, A2, B1, and B2 in what is called a table. Sometimes, we will encourage you to insert dollar signs (“$”) before column letters or row numbers. When you copy and paste cells that contain these symbols, those particular columns and rows will remain unchanged in the cells in which you pasted. Play around with some of these things. If you’ve got a blank spreadsheet, you really won’t hurt anything.
Simple Statistical Functions
Let’s start with the simplest functions regarding averages and variability.
- You may add many cells using the function =SUM(number1,number2,). Those numbers, may of course, be cells or fields. For example, =SUM(A1:A10) adds all of the numbers in column A, rows 1 through 10.
- You may count values using the function =COUNT(number1,number2,). It will count all numbers listed.You may compute the mean using the function =AVERAGE(number1,number2,).You may compute the median using the function =MEDIAN(number1,number2,).
  - There are useful variations on this function called =COUNTIF and =COUNTIFS which you can use to count only specific kinds of numbers. Look them up, if you think you need them.
- You can find the mode using the function =MODE.SNGL(number1,number2,).
- To find the variance of the population[2], use the function =VAR.P(number1,number2,).
- To find the sample variance, then use the function =VAR.S(number1,number2,).
- To find the standard deviation of the population, use the function =STDEV.P(number1,number2,).
- To find the sample standard deviation, use the function =STDEV.S(number1,number2,).
- In order to calculate the sample proportion, use =COUNTIF(range,criteria)/COUNT(number1,number2,).Calculate square roots with SQRT(number).
  - Range must be a table, i.e. A1:A20.
  - Criteria is the condition in which Excel will count your data. If you scored hits as 1’s and misses as 0’s, then your formula should be =COUNTIF(A1:A20,”1”)/COUNT(A1:A20).
- You can create a 95% confidence interval in you spreadsheet. You calculate the standard error () with the function =
  - You may then subtract and add twice the resulting standard error from the mean to get the confidence interval.
  - If you want to use a 98% confidence interval, change the first term from 0.05 to 0.02.
- You can calculate the margin of error at 95% confidence with =1.96*SQRT(((COUNTIF(range,criteria)/COUNT(number1,number2,))*(1-COUNTIF(range,criteria)/COUNT(number1,number2,)))/COUNT(number1,number2)
  - You may then subtract and add the resulting margin of error from the sample proportion to generate a 95% confidence interval.
  - If you want to use a 98% confidence interval, substitute 2.33 for 1.96.
- To perform a hypothesis test on a set of data against a known constant, use =T.DIST.2T(ABS(AVERAGE(number1,number2,)-hypothesis)/(STDEV.S(number1,number2,)/SQRT(COUNT(number1,number2))),COUNT(number1,number2)-1)
  - This is the test for the hypothesis that.
  - The function =ABS(number) generates the absolute value of that number.
  - If you want to check against the hypothesis , substitute T.DIST for T.DIST.2T and add ,FALSE immediately before the final ).
  - If you want to check against the hypothesis , substitute T.DIST.RT for T.DIST.2T.
  - Each of these tests returns the p-value.
- To perform a Student’s t-test with two continuous datasets, use the function =T.TEST(array1,array2,tails,type).
  - Array1 and array2 are the ranges of the two sets of data you are comparing.
  - Tails = 1 if the test is one-tailed. Tails = 2, if the test is two-tailed.
  - Type refers to the parameters of your test. Type = 1, if the data are paired. Type = 2, if the samples have equal variance. Type = 3, if the samples have unequal variance.
  - The output will be a p-value.
- To perform a z-test on binomial data against a hypothesis, use the function =NORM.DIST((AVERAGE(array1)-hypothesis)/SQRT((hypothesis*(1-hypothesis))/COUNT(range1)),0,1,TRUE)
  - This tests the hypothesis that .
  - It returns a p-value.
- To perform a z-test on binomial data against a hypothesis, first use the function =AVERAGE(range1)*COUNT(range1)/(COUNT(range1)+COUNT(range2))+AVERAGE(range2)*COUNT(range2)/(COUNT(range1)+COUNT(range2)). This is the (p-hat the proportion of successes for the combined samples.) In another cell, insert the function =NORM.S.DIST((AVERAGE(range1)-AVERAGE(range2))/SQRT(p-hat*(1-p-hat)*(1/COUNT(range1)+1/COUNT(range2))),FALSE).
  - This test returns a p-value.
- To generate a Pearson coefficient (), use the function =PEARSON(array1,array2).To generate a p-value from a Pearson coefficient, use the function =T.DIST.2T(PEARSON(array1,array2)*SQRT((COUNT(array1)-2))/(1-PEARSON(array1,array2)^2)),COUNT(array1)-2).
  - Both arrays have to have the same number of cells, since all of the values should be paired for individuals.
  - You can add ^2 to the end to get the value .
There are, of course, other tests that you may want to perform. Some of those are explicitly listed in the Excel Function or are Data Analysis options. You can also look at the abundant examples of worked problems available on the internet.
Making Charts
Figure 4: Chart selector in Excel.
In general, chart making in Excel is easy, but it can be hard to get things just right. Once you have some data in your spreadsheet, go to the Insert ribbon and go to the Charts area (Figure 4). There, you will find a variety of charts to choose from. You can select your data area before you click the chart type that you want, and Excel will try to interpret what you want to do. Sometimes it’s right, and sometimes it’s wrong, so you will need to check what it is doing. If you choose not to select your data first, then Excel will produce a blank chart. You can select that and begin working.
Figure 5: Chart Tools tabs.
Once selected, three new Chart Tools tabs will appear in the ribbon: Design, Layout, and Format[3] (Figure 5). Each one contains several tools that you can use to specify what your chart is about and to change how the chart looks. We are not going over everything you can do, but you should know the very basics.
Figure 6: Select Data Source window.
The most important button in the Design tab is the Select Data button. That allows you to tell Excel exactly what data you’re using for the chart. Once you have clicked it, you will be able to add and edit data series (Figure 6). Data series are related data points (i.e., the pH of each individual in your control sample.) If you click the Add or Edit button, you can select the data for the series, as well as name the series.
The Layout tab offers up numerous important options for customizing your chart. You can use the buttons in this tab to add axis labels, a chart title, a legend, and various other options particular to the kind of chart you are working with. We are not going into depth here, and you should experiment. If it looks like you should be about to do something, then you probably can, and we recommend searching the internet for help on any particular thing.
Figure 7: The default Excel color scheme.
The Format tab is mostly about the aesthetics of your chart. You can pick colors, fonts, and other visual modifications in this tab. If you are using the chart in a slideshow, paper, or poster, then you need to make it look special. Excel starts with a particular color scheme that is pleasant (Figure 7), but everyone alread knows it, and they will see that you put no effort into making your figures your own. If color will cost you extra money, then figure out how to use grayscale with your data.
Generating Random Patterns with Excel
Often in science, it’s important to randomize your experimental design. You may need to measure things in a random order or lay them out in a grid without bias introduced from a predictable pattern. There are different ways to do this, and Excel is a readily available tool. Start by listing your individuals, treatments, or whatever else it is that you need to randomize down one column. In a column next to that one, type in the function =RAND(). That will generate a random number between 0 and 1. Select the list you created and the random numbers. In the Home tab, select the Sort and Filter pull-down menu and click Custom. Click Sort By and pick the column with random numbers. Click OK, and your list is randomized. You can delete the random numbers as well. If you are keeping other columns of data or other associated information with the first column, make sure to select them before you sort, or they will stay in their original cells instead of maintaining their association.
[1] We don’t thank accountants enough for things, and the spreadsheet is biggest contribution to society from the world of accounting. Without it, so many useful tasks would be much harder, and early business demand for computers would have been much less, putting us years behind our current computing technology. So, thank you, accountants. You quietly make the world a better place.
[2] Remember the difference between the variance of the population and the sample variance. If you are looking at your data, and it is only a representative sample of the whole population, then should use the sample variance.
[3] I can’t say that Microsoft made good decisions in naming these tabs. They’re all basically synonyms, but you have to work with what you’re given.
DESCRIPTION OF THE SCIENTIFC PROCESS ADOBE ACROBAT FILE (PDF)
Link
Guide for Using Excel for Statistics and Charts Adobe Acrobat File (.pdf)
Link
Statistical Examples in an Excel File
Link