The value of clean data


In my line of research we have fancy algorithms to remove outside contamination to the data we collect. The problem with collecting electrophysiological data (electrical recordings from a person) is there is so much damned noise everywhere. The problem is magnified when you collect data that have a low signal to noise ratio (meaning lots of noise, not a lot of signal). Signal in this case is the thing we’re interested in measuring and while we have dozens of algorithms to filter (remove) the noise, there’s still no substitute for data that was well collected.
(more…)