Exercises in Comprehensive Information Gathering

Exercises in Comprehensive Information Gathering

蛋挞报

作者:Johns Wentworth

2020年2月15日


Looking back, several of the most durably-valuable exercises I’ve done over the years have a general theme of comprehensive information gathering.

The most recent example involves capital investments. Economists talk about “capital goods” as physical stuff - machines, buildings, etc. But in practice, savings and investments are passed through banks and ETFs, bundled and securitized, involve debts and shares of companies which own debts and shares of other companies, and so forth… where does all that capital end up? To get an intuitive sense, I pulled up fundamental data on about 7000 US publicly-traded companies in quantopian, sorted them by amount of non-financial assets, and found that the top 100 accounted for about 50% of the non-financial assets of the whole set. Then, I looked at annual reports for each of those 100 companies, to see what capital assets they had. I googled around for pictures and maps of where those assets were located, and read up on anything I hadn’t heard of before. What’s a “central office”, where are they, what do they look like, and why does AT&T have $90B worth of them? What are the major US oil basins, where are the wells, and what all goes into drilling them? What are the technical differences between traditional phone, cable, satellite, and cell networks, and how do those technical differences impact the capital requirements of each? Who runs power plants and the power grid in various parts of the country? What are the major US railroads, and where are they? Why did GE own so many airplanes? These are the kinds of questions which come up when you want to know what “capital goods” actually consist of, in the real world.

Another interesting exercise: I read through five years of Nature archives, reading all the titles and any abstracts which sounded novel/interesting. I didn’t google everything I hadn’t heard of; instead, I’d wait until the same acronym popped up a few times before looking it up. This took maybe a week of evenings after work. By the end, I could at least place the large majority of articles in context. Now, when I see a title full of jargon in a field I haven’t studied, like “Novel tau filament fold in corticobasal degeneration”, I usually at least understand enough to guess at what it’s relevant to (in this case: neurodegenerative disease involving protein aggregates, probably Alzheimers?). I can generally follow conversations in a bunch of different fields - not necessarily between specialists in the same sub-sub-field, but at least the level of a typical conference talk, and when I meet new people I can ask not-too-embarrassing questions about what they’re researching.

Going back further, if you’re in college, I strongly recommend reading your entire course catalogue, googling anything you’ve never heard of at all, and marking anything that sounds potentially interesting. This seems really obvious; it only takes a few hours, and something something a pile of value sitting on a silver platter right in front of you. (Note: I went to a small STEM school; if you’re at a big school with a bajillion courses or a school with poor STEM coverage or not at college at all, consider reading an MIT/Caltech course catalogue instead, to get a feel for what all is out there.) You never know what surprising and interesting topics might be hiding in there - microfluidics, underactuated robotics, recursive macroeconomics, systems biology, synthetic biology, origami algorithms, computational photography, evo-devo, procedural graphics, and on and on.

These sort of exercises provide value in a few ways:

  • They reveal unknown unknowns - things you didn’t even realize were missing from your picture of the world.
  • You can’t make a map of a city by sitting in your room with the shades drawn; exercises like these force you to look at large slices of the world.
  • Knowledge within fields tends to have decreasing marginal returns - your first physics or CS class will teach you much more than your eighth. These exercises give a broad, brief glance at many areas where you probably haven’t reached decreasing marginal returns yet.
  • You can get a very rough big-picture sense of how much effort other people are investing in various areas - e.g. where most capital investments go or where most research effort goes - which is useful for understanding the world in general.
  • While these exercises don’t avoid biased selection of information altogether, they’re probably different biases from what you run into naturally, and they’re systematic enough that we can guess at what biases are likely to be present.
  • They’re a lot of fun, if you have a curious streak.

Most importantly: I’ve found each of these exercises to have lasting, long-term value in exchange for a one-time investment of effort.

Other exercises which are on my to-do list, but which I haven’t done yet:

I’m curious to hear other suggestions for exercises along these lines.



【by lionhearted】

I've done similarly. It's actually remarkable how little time it takes to overview the history of breakthroughs in a sub-field, or all the political and military leaders of an obscure country during a particular era, or the history of laws and regulations of a a particular field.

Question to muse over — Given how inexpensive and useful it is to do this, why do so few people it?

【by johnswentworth】

Given how inexpensive and useful it is to do this, why do so few people it?

I actually considered putting a paragraph on this in the OP. I think we're currently in a transitional state - prior to the internet, it would have been far more expensive to conduct this sort of exercise. People haven't had much time to figure out how to get lots of value out of the internet, and this is one example which I expect will become more popular over time.

【by lionhearted】

Makes sense. This is probably worth a top level post? —

>People haven't had much time to figure out how to get lots of value out of the internet, and this is one example which I expect will become more popular over time.

Sounds obvious when put like that, but I think — as you implied — a lot of people haven't thought about it yet.

【by Raemon】

Curated.

I like the idea of posts that suggest concrete exercises, and I think the sort of project John is pointing at here is something I hope LessWrong folk to do more often. 

I also think it lends itself well as a self-reinforcing concept on LessWrong in particular (i.e. lots of rationality exercise you might just do quietly by yourself, but the sort of review John suggests here seems like it'd often lead to good new blogposts that'd be useful for others to learn from, as well as reminding people about the possibility of doing this exercise for themselves. Although obviously if you just end up doing it for yourself that's quite valuable as well). 

【by Viliam】

Given how inexpensive and useful it is to do this, why do so few people it?

Because there are so many possible topics, that even if each of them takes relatively little time, together they would take a lot?

For example, in your example, you mentioned " an obscure country" and "a particular era", and also a focus on politics and military (as opposed to science, or art, or sport). Okay, maybe you can do it in a week, or in an afternoon. But why that country, and why that era? How much it would cost to get a comparable knowledge of all countries and, uhm, let's say the entire 20th century?

【by lionhearted】

Ahh, great question. 

I think eventually patterns start to emerge — so eventually, you start reading about federalization of Chinese Law and you're "ah, this is like German Unification with a few key differences."

While you do find rare outliers — the Ottoman legal system continues to fascinate me ( https://en.wikipedia.org/wiki/Millet_(Ottoman_Empire) ) — you eventually find that there's only a few major ways that legal systems have been formulated at larger modern country scales than earlier local scales.

Science, art, and sport are also ones I've delved into incidentally. And there's also some patterns there.



资料来源:LessWrong

https://www.lesswrong.com/posts/9LXxgXySTFsnookkw/exercises-in-comprehensive-information-gathering


Report Page