{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# PhD Guide Project\n", "\n", "*Весенний проект по курсу [«Наука о данных»](http://math-info.hse.ru/s18/y), Совместный бакалавриат ВШЭ-РЭШ, 2018-19 учебный год.*\n", "\n", "*Автор работы: Анна Щеткина.*\n", "\n", "Добро пожаловать! Этот проект — несколько больших таблиц, связанных с поступлением на PhD. В первой части вы найдете некоторую информацию об университетах, во второй — условия жизни в городах, в которых они расположены, в третьей — сможете выучить слова к GRE с котами-помощниками. \n", "\n", "К сожалению, этот код работает от ретроградного Меркурия и есть вероятность, что он где-то сломается. Это связано с тем, что мы парсим несколько разных сайтов, которые то не работают, то пытаются защититься от парсинга, то еще что-нибудь. Именно из-за веб-скреппинга код кроме того что запускается не всегда, еще и долго работает. Для вашего удобства все основные результаты уже есть в ноутбуке, поэтому вы можете посмотреть как выглядят готовые датафреймы и графики. Также 3 часть (самая интерактивная) может быть запущена отдельно, если вы хотите посмотреть на котов сами.\n", "\n", "Далее идет описание всех технологий по критериям, после чего сам проект с комментариями." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Описание проекта\n", "\n", "1. Обработка данных с помощью pandas. Так как весь проект — это создание трех таблиц, вполне естественно, что мы использовали продвинутые возможности pandas. Мы объединяли таблицы, применяли много разных функций к столбцам, нормировали, работали со строками в этих таблицах, сортировали, переименовывали и даже использовали pivot когда считали средние издержки на рисерч.\n", "\n", "2. Веб-скреппинг. И снова мы спарсили кучу всего, для чего нам пришлось использовать много разных методов. Selenium мы использовали целых два раза и не только, чтобы зайти на страницу. С его помощью мы скроллили страницу вниз, чтобы загрузить всю нужную нам информацию, вбивали запрос в форму, нажимали enter, искали ссылку по тексту, переходили по ссылке и обратно. BeautifulSoup мы тоже использовали в огромных масштабах почти в каждой ячейке.\n", "\n", "3. Работа с REST API (XML/JSON). Существенная часть используемых данных была получена из API DataUSA (через JSON). Нам сначала пришлось воспользоваться атрибутами этого API, чтобы найти локальные индексы интересующих нас университетов и городов, затем применить продвинутый фильтеринг и выкачать все данные в таблицу, которую мы дальше объединяли с нашей с помощью merge, но это уже снова первый пункт. Венцом нетривиального использования API в этом проекте, конечно, являются рандомные коты (тоже с помощью фильтеринга — это видно по url) для изучения слов.\n", "\n", "4. Визуализация данных. Для визуализации мы использовали красивые графики в seaborn — barplot и heatmap. Пришлось немного поколдовать над данными (отнормировать и отсортировать), чтобы графики были совсем красивыми, а еще подобрать параметры визуализации для большей наглядности.\n", "\n", "5. Математические возможности Python (содержательное использование numpy/scipy, SymPy и т.д. для решения математических задач). Мы все же смогли найти математические задачи в таблице с универами! Когда нормировали, а еще когда считали взвешенные суммы для индексов с помощью скалярного произведения из numpy.\n", "\n", "6. Другие технологии (пока не обсуждавшиеся в курсе, но связанные с обработкой данных с помощью Python, R или других языков программирования) — например, телеграм-боты или методы машинного обучения или регулярные выражения или ещё что-нибудь. Мы использовали styler для того, чтобы сделать ссылки в таблице кликабельными, datetime чтобы обрабатывать дату и считать количество дней и средства работы с изображениями, чтобы по url картинка отображалась прямо в юпитере и в цикле.\n", "\n", "7. Объём (осмысленных строк кода). Их под двести, точно больше 75.\n", "\n", "8. Общее впечатление. Надеюсь, вам понравилось :)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Часть 1. Университеты\n", "\n", "Здесь мы скачиваем список топ-30 PhD программ по экономике по версии US News, а также с помощью википедии, сайта Вышки и API Data USA добавляем немного интересных фактов: ссылку на соответствующий экон департмент, количество нобелевских лауреатов, наших выпускников, кто поступил в эти универы, а также сколько университеты тратят на рисерч." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Импортируем все, что нам понадобится" ] }, { "cell_type": "code", "execution_count": 1, "metadata": {}, "outputs": [], "source": [ "import pandas as pd\n", "import numpy as np\n", "import requests\n", "from bs4 import BeautifulSoup\n", "from selenium import webdriver\n", "import time\n", "import seaborn as sns\n", "from matplotlib import pyplot as plt\n", "%matplotlib inline " ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Этот проект использует селениум chrome. Если у вас его нет, в следующей ячейке поменяйте, пожалуйста, его на свой селениум. Больше нигде менять не нужно.\n", "\n", "Сейчас мы с помощью селениума спарсим топ-30 универов с USNews (без него питон парсить не хочет)" ] }, { "cell_type": "code", "execution_count": 8, "metadata": {}, "outputs": [], "source": [ "list_univs = []\n", "browser = webdriver.Chrome()\n", "ref = 'https://www.usnews.com/best-graduate-schools/top-humanities-schools/economics-rankings'\n", "browser.get(ref)\n", "time.sleep(2)\n", "#на этой странице полная таблица (как вк) доступна только если прокрутить вниз. \n", "#Прокрутим вниз немного, 20 универов это все же слишком оптимистично\n", "browser.execute_script(\"window.scrollTo(0, 4000)\")\n", "time.sleep(3)\n", "soup = BeautifulSoup(browser.page_source)\n", "table = soup.find('table', class_=\"TableStacked__Container-u0w2tu-0 geKOYi\")\n", "for univ in table.find_all('div', class_=\"Box-s85n6m5-0 kKdFhD\")[:30]:\n", " list_univs.append([univ[\"name\"], univ.p.text.split(', ')[0], univ.p.text.split(', ')[1], univ.p.text])\n", "df = pd.DataFrame(list_univs, columns = ['University', 'Location', 'State', 'Full Location'])" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "В следующей ячейке вы можете посмотреть, как сейчас выглядит таблица. Там должно быть 30 универов с указанием города, штата в виде почтового двухбуквенного кода и колонка город + штат." ] }, { "cell_type": "code", "execution_count": 9, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", " | University | \n", "Location | \n", "State | \n", "Full Location | \n", "
---|---|---|---|---|
0 | \n", "Harvard University | \n", "Cambridge | \n", "MA | \n", "Cambridge, MA | \n", "
1 | \n", "Massachusetts Institute of Technology | \n", "Cambridge | \n", "MA | \n", "Cambridge, MA | \n", "
2 | \n", "Princeton University | \n", "Princeton | \n", "NJ | \n", "Princeton , NJ | \n", "
3 | \n", "Stanford University | \n", "Stanford | \n", "CA | \n", "Stanford, CA | \n", "
4 | \n", "University of California--Berkeley | \n", "Berkeley | \n", "CA | \n", "Berkeley, CA | \n", "
5 | \n", "Yale University | \n", "New Haven | \n", "CT | \n", "New Haven, CT | \n", "
6 | \n", "Northwestern University | \n", "Evanston | \n", "IL | \n", "Evanston, IL | \n", "
7 | \n", "University of Chicago | \n", "Chicago | \n", "IL | \n", "Chicago, IL | \n", "
8 | \n", "Columbia University | \n", "New York | \n", "NY | \n", "New York, NY | \n", "
9 | \n", "University of Pennsylvania | \n", "Philadelphia | \n", "PA | \n", "Philadelphia, PA | \n", "
10 | \n", "New York University | \n", "New York | \n", "NY | \n", "New York, NY | \n", "
11 | \n", "University of California--Los Angeles | \n", "Los Angeles | \n", "CA | \n", "Los Angeles, CA | \n", "
12 | \n", "University of California--San Diego | \n", "La Jolla | \n", "CA | \n", "La Jolla, CA | \n", "
13 | \n", "University of Michigan--Ann Arbor | \n", "Ann Arbor | \n", "MI | \n", "Ann Arbor, MI | \n", "
14 | \n", "University of Wisconsin--Madison | \n", "Madison | \n", "WI | \n", "Madison, WI | \n", "
15 | \n", "Cornell University | \n", "Ithaca | \n", "NY | \n", "Ithaca, NY | \n", "
16 | \n", "Duke University | \n", "Durham | \n", "NC | \n", "Durham, NC | \n", "
17 | \n", "University of Minnesota--Twin Cities | \n", "Minneapolis | \n", "MN | \n", "Minneapolis, MN | \n", "
18 | \n", "Brown University | \n", "Providence | \n", "RI | \n", "Providence, RI | \n", "
19 | \n", "Carnegie Mellon University | \n", "Pittsburgh | \n", "PA | \n", "Pittsburgh, PA | \n", "
20 | \n", "University of Maryland--College Park | \n", "College Park | \n", "MD | \n", "College Park, MD | \n", "
21 | \n", "University of Rochester | \n", "Rochester | \n", "NY | \n", "Rochester, NY | \n", "
22 | \n", "Boston University | \n", "Boston | \n", "MA | \n", "Boston, MA | \n", "
23 | \n", "Johns Hopkins University | \n", "Baltimore | \n", "MD | \n", "Baltimore, MD | \n", "
24 | \n", "Boston College | \n", "Chestnut Hill | \n", "MA | \n", "Chestnut Hill, MA | \n", "
25 | \n", "Pennsylvania State University--University Park | \n", "University Park | \n", "PA | \n", "University Park, PA | \n", "
26 | \n", "University of Texas--Austin | \n", "Austin | \n", "TX | \n", "Austin, TX | \n", "
27 | \n", "Washington University in St. Louis | \n", "St. Louis | \n", "MO | \n", "St. Louis , MO | \n", "
28 | \n", "Michigan State University | \n", "East Lansing | \n", "MI | \n", "East Lansing, MI | \n", "
29 | \n", "Ohio State University | \n", "Columbus | \n", "OH | \n", "Columbus, OH | \n", "
\n", " | University | \n", "Location | \n", "State | \n", "Full Location | \n", "Econ Department | \n", "Alumni | \n", "#Nobel Prizes | \n", "University ID | \n", "Average Research Expenditure | \n", "
---|---|---|---|---|---|---|---|---|---|
0 | \n", "Harvard University | \n", "Cambridge | \n", "Massachusetts (Boston) | \n", "Cambridge, MA | \n", "http://www.economics.harvard.edu/graduate | \n", "Мария Воронина, Роман Сигалов | \n", "8 | \n", "166027 | \n", "8.119160e+08 | \n", "
1 | \n", "Massachusetts Institute of Technology | \n", "Cambridge | \n", "Massachusetts (Boston) | \n", "Cambridge, MA | \n", "http://econ-www.mit.edu/graduate/ | \n", "You might be the first guy from HSE here! | \n", "10 | \n", "166683 | \n", "1.304430e+09 | \n", "
2 | \n", "Princeton University | \n", "Princeton | \n", "New Jersey (Trenton) | \n", "Princeton , NJ | \n", "http://www.princeton.edu/economics/graduate/ | \n", "Анна Белоног, Никита Мельников | \n", "6 | \n", "186131 | \n", "3.017482e+08 | \n", "
3 | \n", "Stanford University | \n", "Stanford | \n", "California (Sacramento) | \n", "Stanford, CA | \n", "http://economics.stanford.edu/graduate | \n", "Алина Арефьева, Иван Королев, Алина Арефьева, ... | \n", "3 | \n", "243744 | \n", "1.076905e+09 | \n", "
4 | \n", "University of California, Berkeley | \n", "Berkeley | \n", "California (Sacramento) | \n", "Berkeley, CA | \n", "http://emlab.berkeley.edu/econ/grad/grad.shtml | \n", "Дарья Бахарева, Петр Мартынов, Сергей Стеблёв | \n", "2 | \n", "110635 | \n", "6.475572e+08 | \n", "
5 | \n", "Yale University | \n", "New Haven | \n", "Connecticut (Hartford) | \n", "New Haven, CT | \n", "http://www.econ.yale.edu/graduate/index.htm | \n", "Иван Кленовский | \n", "6 | \n", "130794 | \n", "4.926600e+08 | \n", "
6 | \n", "Northwestern University | \n", "Evanston | \n", "Illinois (Springfield) | \n", "Evanston, IL | \n", "http://www.econ.northwestern.edu/phd/index.html | \n", "Егор Козлов, Алексей Макарьин, Дмитрий Седов | \n", "2 | \n", "147767 | \n", "4.291662e+08 | \n", "
7 | \n", "University of Chicago | \n", "Chicago | \n", "Illinois (Springfield) | \n", "Chicago, IL | \n", "http://economics.uchicago.edu/graduate.shtml | \n", "Юлия Жесткова | \n", "14 | \n", "144050 | \n", "3.285612e+08 | \n", "
8 | \n", "Columbia University | \n", "New York | \n", "New York (Albany) | \n", "New York, NY | \n", "http://econ.columbia.edu/graduate-program | \n", "Тимур Аббясов, Анастасия Гоноцкая | \n", "4 | \n", "190150 | \n", "6.533390e+08 | \n", "
9 | \n", "University of Pennsylvania | \n", "Philadelphia | \n", "Pennsylvania (Harrisburg) | \n", "Philadelphia, PA | \n", "http://crim.sas.upenn.edu/ | \n", "Эдвард Бахитов, Александр Беляков, Мария Гельр... | \n", "2 | \n", "215062 | \n", "7.205335e+08 | \n", "
10 | \n", "New York University | \n", "New York | \n", "New York (Albany) | \n", "New York, NY | \n", "http://econ.as.nyu.edu/page/home | \n", "Александра Алферова, Анна Денисенко, Кристина ... | \n", "1 | \n", "193900 | \n", "7.725442e+08 | \n", "
11 | \n", "University of California, Los Angeles | \n", "Los Angeles | \n", "California (Sacramento) | \n", "Los Angeles, CA | \n", "http://www.econ.ucla.edu/graduate/ | \n", "Степан Алексенко, Иван Лавров, Евгения Назрулл... | \n", "1 | \n", "110662 | \n", "8.231362e+08 | \n", "
12 | \n", "University of California, San Diego | \n", "La Jolla | \n", "California (Sacramento) | \n", "La Jolla, CA | \n", "http://economics.ucsd.edu/grad/index.php | \n", "Александр Левкун, Анастасия Файкина | \n", "2 | \n", "110680 | \n", "8.574648e+08 | \n", "
13 | \n", "University of Michigan, Ann Arbor | \n", "Ann Arbor | \n", "Michigan (Lansing) | \n", "Ann Arbor, MI | \n", "http://www.lsa.umich.edu/econ/graduatestudy | \n", "You might be the first guy from HSE here! | \n", "0 | \n", "170976 | \n", "8.218498e+08 | \n", "
14 | \n", "University of Wisconsin, Madison | \n", "Madison | \n", "Wisconsin (Madison) | \n", "Madison, WI | \n", "http://www.ssc.wisc.edu/econ/grad | \n", "Алина Арефьева, Рената Гайнедденова, Андрей Зу... | \n", "0 | \n", "240444 | \n", "9.069298e+08 | \n", "
15 | \n", "Cornell University | \n", "Ithaca | \n", "New York (Albany) | \n", "Ithaca, NY | \n", "http://www.economics.cornell.edu/graduate/grad... | \n", "You might be the first guy from HSE here! | \n", "1 | \n", "190415 | \n", "3.624760e+08 | \n", "
16 | \n", "Duke University | \n", "Durham | \n", "North Carolina (Raleigh) | \n", "Durham, NC | \n", "http://econ.duke.edu/phd-program/degree-reqs | \n", "Илья Козис, Екатерина Рощина | \n", "0 | \n", "198419 | \n", "8.628472e+08 | \n", "
17 | \n", "University of Minnesota, Twin Cities | \n", "Minneapolis | \n", "Minnesota (St. Paul) | \n", "Minneapolis, MN | \n", "http://www.econ.umn.edu/graduate/index.html | \n", "Константин Голяев, Константин Голяев, Егор Мал... | \n", "0 | \n", "174066 | \n", "7.794002e+08 | \n", "
18 | \n", "Brown University | \n", "Providence | \n", "Rhode Island (Providence) | \n", "Providence, RI | \n", "http://www.brown.edu/Departments/Economics/gra... | \n", "Сергей Панкратьев, Вячеслав Савицкий, Александ... | \n", "0 | \n", "217156 | \n", "1.135165e+08 | \n", "
19 | \n", "Carnegie Mellon University | \n", "Pittsburgh | \n", "Pennsylvania (Harrisburg) | \n", "Pittsburgh, PA | \n", "http://www.tepper.cmu.edu/doctoral-program/fie... | \n", "You might be the first guy from HSE here! | \n", "4 | \n", "211440 | \n", "2.198390e+08 | \n", "
20 | \n", "University of Maryland, College Park | \n", "College Park | \n", "Maryland (Annapolis) | \n", "College Park, MD | \n", "https://ccjs.umd.edu/ | \n", "You might be the first guy from HSE here! | \n", "0 | \n", "163286 | \n", "4.436358e+08 | \n", "
21 | \n", "University of Rochester | \n", "Rochester | \n", "New York (Albany) | \n", "Rochester, NY | \n", "http://www.econ.rochester.edu/graduate/ | \n", "You might be the first guy from HSE here! | \n", "0 | \n", "195030 | \n", "3.005762e+08 | \n", "
22 | \n", "Boston University | \n", "Boston | \n", "Massachusetts (Boston) | \n", "Boston, MA | \n", "http://www.bu.edu/econ/gradprgms/ | \n", "You might be the first guy from HSE here! | \n", "0 | \n", "164988 | \n", "1.819760e+08 | \n", "
23 | \n", "Johns Hopkins University | \n", "Baltimore | \n", "Maryland (Annapolis) | \n", "Baltimore, MD | \n", "http://www.econ.jhu.edu/grad-prog.html | \n", "You might be the first guy from HSE here! | \n", "0 | \n", "162928 | \n", "1.344580e+09 | \n", "
24 | \n", "Boston College | \n", "Chestnut Hill | \n", "Massachusetts (Boston) | \n", "Chestnut Hill, MA | \n", "http://fmwww.bc.edu/ec/grad.php | \n", "You might be the first guy from HSE here! | \n", "0 | \n", "164924 | \n", "3.629858e+07 | \n", "
25 | \n", "Pennsylvania State University, University Park | \n", "University Park | \n", "Pennsylvania (Harrisburg) | \n", "University Park, PA | \n", "http://sociology.la.psu.edu/graduate/programs/... | \n", "Михаил Заварзин, Олег Муратов | \n", "0 | \n", "214777 | \n", "7.703808e+08 | \n", "
26 | \n", "University of Texas, Austin | \n", "Austin | \n", "Texas (Austin) | \n", "Austin, TX | \n", "http://www.utexas.edu/cola/depts/economics/phd... | \n", "Владимир Меньшиков | \n", "0 | \n", "228778 | \n", "5.338412e+08 | \n", "
27 | \n", "Washington University in St. Louis | \n", "St. Louis | \n", "Missouri (Jefferson City) | \n", "St. Louis , MO | \n", "http://economics.wustl.edu/ | \n", "You might be the first guy from HSE here! | \n", "0 | \n", "179867 | \n", "4.677470e+08 | \n", "
28 | \n", "Michigan State University | \n", "East Lansing | \n", "Michigan (Lansing) | \n", "East Lansing, MI | \n", "http://cj.msu.edu/programs/doctorate/ | \n", "You might be the first guy from HSE here! | \n", "0 | \n", "171100 | \n", "3.832870e+08 | \n", "
29 | \n", "Ohio State University | \n", "Columbus | \n", "Ohio (Columbus) | \n", "Columbus, OH | \n", "https://economics.osu.edu/successful-career-ec... | \n", "You might be the first guy from HSE here! | \n", "0 | \n", "204796 | \n", "4.932828e+08 | \n", "
\n", " | University | \n", "Location | \n", "State | \n", "Full Location | \n", "Econ Department | \n", "Alumni | \n", "#Nobel Prizes | \n", "University ID | \n", "Average Research Expenditure | \n", "
---|---|---|---|---|---|---|---|---|---|
0 | \n", "Harvard University | \n", "Cambridge | \n", "Massachusetts (Boston) | \n", "Cambridge, MA | \n", "http://www.economics.harvard.edu/graduate | \n", "Мария Воронина, Роман Сигалов | \n", "8 | \n", "166027 | \n", "8.11916e+08 | \n", "
1 | \n", "Massachusetts Institute of Technology | \n", "Cambridge | \n", "Massachusetts (Boston) | \n", "Cambridge, MA | \n", "http://econ-www.mit.edu/graduate/ | \n", "You might be the first guy from HSE here! | \n", "10 | \n", "166683 | \n", "1.30443e+09 | \n", "
2 | \n", "Princeton University | \n", "Princeton | \n", "New Jersey (Trenton) | \n", "Princeton , NJ | \n", "http://www.princeton.edu/economics/graduate/ | \n", "Анна Белоног, Никита Мельников | \n", "6 | \n", "186131 | \n", "3.01748e+08 | \n", "
3 | \n", "Stanford University | \n", "Stanford | \n", "California (Sacramento) | \n", "Stanford, CA | \n", "http://economics.stanford.edu/graduate | \n", "Алина Арефьева, Иван Королев, Алина Арефьева, Даниил Вишнев, Иван Королев, Павел Кривенко, Лина Лукьянцева | \n", "3 | \n", "243744 | \n", "1.07690e+09 | \n", "
4 | \n", "University of California, Berkeley | \n", "Berkeley | \n", "California (Sacramento) | \n", "Berkeley, CA | \n", "http://emlab.berkeley.edu/econ/grad/grad.shtml | \n", "Дарья Бахарева, Петр Мартынов, Сергей Стеблёв | \n", "2 | \n", "110635 | \n", "6.47557e+08 | \n", "
5 | \n", "Yale University | \n", "New Haven | \n", "Connecticut (Hartford) | \n", "New Haven, CT | \n", "http://www.econ.yale.edu/graduate/index.htm | \n", "Иван Кленовский | \n", "6 | \n", "130794 | \n", "4.9266e+08 | \n", "
6 | \n", "Northwestern University | \n", "Evanston | \n", "Illinois (Springfield) | \n", "Evanston, IL | \n", "http://www.econ.northwestern.edu/phd/index.html | \n", "Егор Козлов, Алексей Макарьин, Дмитрий Седов | \n", "2 | \n", "147767 | \n", "4.29166e+08 | \n", "
7 | \n", "University of Chicago | \n", "Chicago | \n", "Illinois (Springfield) | \n", "Chicago, IL | \n", "http://economics.uchicago.edu/graduate.shtml | \n", "Юлия Жесткова | \n", "14 | \n", "144050 | \n", "3.28561e+08 | \n", "
8 | \n", "Columbia University | \n", "New York | \n", "New York (Albany) | \n", "New York, NY | \n", "http://econ.columbia.edu/graduate-program | \n", "Тимур Аббясов, Анастасия Гоноцкая | \n", "4 | \n", "190150 | \n", "6.53339e+08 | \n", "
9 | \n", "University of Pennsylvania | \n", "Philadelphia | \n", "Pennsylvania (Harrisburg) | \n", "Philadelphia, PA | \n", "http://crim.sas.upenn.edu/ | \n", "Эдвард Бахитов, Александр Беляков, Мария Гельруд, Ирина Пименова, Харис Соколов | \n", "2 | \n", "215062 | \n", "7.20534e+08 | \n", "
10 | \n", "New York University | \n", "New York | \n", "New York (Albany) | \n", "New York, NY | \n", "http://econ.as.nyu.edu/page/home | \n", "Александра Алферова, Анна Денисенко, Кристина Комиссарова, Василий Русанов, Сергей Санович, Дмитрий Сорокин | \n", "1 | \n", "193900 | \n", "7.72544e+08 | \n", "
11 | \n", "University of California, Los Angeles | \n", "Los Angeles | \n", "California (Sacramento) | \n", "Los Angeles, CA | \n", "http://www.econ.ucla.edu/graduate/ | \n", "Степан Алексенко, Иван Лавров, Евгения Назруллаева, Кирилл Пономарев, Арсений Самсонов, Вадим Храмов | \n", "1 | \n", "110662 | \n", "8.23136e+08 | \n", "
12 | \n", "University of California, San Diego | \n", "La Jolla | \n", "California (Sacramento) | \n", "La Jolla, CA | \n", "http://economics.ucsd.edu/grad/index.php | \n", "Александр Левкун, Анастасия Файкина | \n", "2 | \n", "110680 | \n", "8.57465e+08 | \n", "
13 | \n", "University of Michigan, Ann Arbor | \n", "Ann Arbor | \n", "Michigan (Lansing) | \n", "Ann Arbor, MI | \n", "http://www.lsa.umich.edu/econ/graduatestudy | \n", "You might be the first guy from HSE here! | \n", "0 | \n", "170976 | \n", "8.2185e+08 | \n", "
14 | \n", "University of Wisconsin, Madison | \n", "Madison | \n", "Wisconsin (Madison) | \n", "Madison, WI | \n", "http://www.ssc.wisc.edu/econ/grad | \n", "Алина Арефьева, Рената Гайнедденова, Андрей Зубанов, Анна Трубникова | \n", "0 | \n", "240444 | \n", "9.0693e+08 | \n", "
15 | \n", "Cornell University | \n", "Ithaca | \n", "New York (Albany) | \n", "Ithaca, NY | \n", "http://www.economics.cornell.edu/graduate/graduate.html | \n", "You might be the first guy from HSE here! | \n", "1 | \n", "190415 | \n", "3.62476e+08 | \n", "
16 | \n", "Duke University | \n", "Durham | \n", "North Carolina (Raleigh) | \n", "Durham, NC | \n", "http://econ.duke.edu/phd-program/degree-reqs | \n", "Илья Козис, Екатерина Рощина | \n", "0 | \n", "198419 | \n", "8.62847e+08 | \n", "
17 | \n", "University of Minnesota, Twin Cities | \n", "Minneapolis | \n", "Minnesota (St. Paul) | \n", "Minneapolis, MN | \n", "http://www.econ.umn.edu/graduate/index.html | \n", "Константин Голяев, Константин Голяев, Егор Малков, Владимир Смирнягин | \n", "0 | \n", "174066 | \n", "7.794e+08 | \n", "
18 | \n", "Brown University | \n", "Providence | \n", "Rhode Island (Providence) | \n", "Providence, RI | \n", "http://www.brown.edu/Departments/Economics/graduate.php | \n", "Сергей Панкратьев, Вячеслав Савицкий, Александр Яркин | \n", "0 | \n", "217156 | \n", "1.13516e+08 | \n", "
19 | \n", "Carnegie Mellon University | \n", "Pittsburgh | \n", "Pennsylvania (Harrisburg) | \n", "Pittsburgh, PA | \n", "http://www.tepper.cmu.edu/doctoral-program/fields-of-study/economics/index.aspx | \n", "You might be the first guy from HSE here! | \n", "4 | \n", "211440 | \n", "2.19839e+08 | \n", "
20 | \n", "University of Maryland, College Park | \n", "College Park | \n", "Maryland (Annapolis) | \n", "College Park, MD | \n", "https://ccjs.umd.edu/ | \n", "You might be the first guy from HSE here! | \n", "0 | \n", "163286 | \n", "4.43636e+08 | \n", "
21 | \n", "University of Rochester | \n", "Rochester | \n", "New York (Albany) | \n", "Rochester, NY | \n", "http://www.econ.rochester.edu/graduate/ | \n", "You might be the first guy from HSE here! | \n", "0 | \n", "195030 | \n", "3.00576e+08 | \n", "
22 | \n", "Boston University | \n", "Boston | \n", "Massachusetts (Boston) | \n", "Boston, MA | \n", "http://www.bu.edu/econ/gradprgms/ | \n", "You might be the first guy from HSE here! | \n", "0 | \n", "164988 | \n", "1.81976e+08 | \n", "
23 | \n", "Johns Hopkins University | \n", "Baltimore | \n", "Maryland (Annapolis) | \n", "Baltimore, MD | \n", "http://www.econ.jhu.edu/grad-prog.html | \n", "You might be the first guy from HSE here! | \n", "0 | \n", "162928 | \n", "1.34458e+09 | \n", "
24 | \n", "Boston College | \n", "Chestnut Hill | \n", "Massachusetts (Boston) | \n", "Chestnut Hill, MA | \n", "http://fmwww.bc.edu/ec/grad.php | \n", "You might be the first guy from HSE here! | \n", "0 | \n", "164924 | \n", "3.62986e+07 | \n", "
25 | \n", "Pennsylvania State University, University Park | \n", "University Park | \n", "Pennsylvania (Harrisburg) | \n", "University Park, PA | \n", "http://sociology.la.psu.edu/graduate/programs/criminology/criminology-program-requirements | \n", "Михаил Заварзин, Олег Муратов | \n", "0 | \n", "214777 | \n", "7.70381e+08 | \n", "
26 | \n", "University of Texas, Austin | \n", "Austin | \n", "Texas (Austin) | \n", "Austin, TX | \n", "http://www.utexas.edu/cola/depts/economics/phd/Graduate.php | \n", "Владимир Меньшиков | \n", "0 | \n", "228778 | \n", "5.33841e+08 | \n", "
27 | \n", "Washington University in St. Louis | \n", "St. Louis | \n", "Missouri (Jefferson City) | \n", "St. Louis , MO | \n", "http://economics.wustl.edu/ | \n", "You might be the first guy from HSE here! | \n", "0 | \n", "179867 | \n", "4.67747e+08 | \n", "
28 | \n", "Michigan State University | \n", "East Lansing | \n", "Michigan (Lansing) | \n", "East Lansing, MI | \n", "http://cj.msu.edu/programs/doctorate/ | \n", "You might be the first guy from HSE here! | \n", "0 | \n", "171100 | \n", "3.83287e+08 | \n", "
29 | \n", "Ohio State University | \n", "Columbus | \n", "Ohio (Columbus) | \n", "Columbus, OH | \n", "https://economics.osu.edu/successful-career-economics | \n", "You might be the first guy from HSE here! | \n", "0 | \n", "204796 | \n", "4.93283e+08 | \n", "
\n", " | City | \n", "Full Location | \n", "pop | \n", "income | \n", "mean_commute_minutes | \n", "primary_care_physicians | \n", "motor_vehicle_crash_deaths | \n", "health_care_costs | \n", "violent_crime | \n", "food_environment_index | \n", "polution_ppm | \n", "Apartment (1 bedroom) Outside of Centre | \n", "Meal, Inexpensive Restaurant | \n", "Chicken Breasts | \n", "Monthly Pass | \n", "Basic Utilities monthly for 85 m2 | \n", "Internet monthly | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "Cambridge, MA | \n", "Cambridge, MA | \n", "108757 | \n", "83122.0 | \n", "23.4675 | \n", "128.0 | \n", "3.97 | \n", "9121.0 | \n", "206.55 | \n", "8.8 | \n", "9.6 | \n", "1928.21 | \n", "14.00 | \n", "9.03 | \n", "84.50 | \n", "134.00 | \n", "71.62 | \n", "
1 | \n", "Princeton , NJ | \n", "Princeton, NJ | \n", "30168 | \n", "118467.0 | \n", "23.1256 | \n", "104.0 | \n", "7.56 | \n", "10367.0 | \n", "401.89 | \n", "8.1 | \n", "8.9 | \n", "1387.50 | \n", "13.50 | \n", "8.82 | \n", "89.50 | \n", "150.00 | \n", "63.34 | \n", "
2 | \n", "Stanford, CA | \n", "Stanford, CA | \n", "14926 | \n", "52208.0 | \n", "13.0963 | \n", "103.0 | \n", "5.92 | \n", "7797.0 | \n", "253.90 | \n", "8.5 | \n", "8.8 | \n", "2266.67 | \n", "15.00 | \n", "9.51 | \n", "70.00 | \n", "144.17 | \n", "69.18 | \n", "
3 | \n", "Berkeley, CA | \n", "Berkeley, CA | \n", "118585 | \n", "70393.0 | \n", "25.1941 | \n", "107.0 | \n", "5.23 | \n", "8395.0 | \n", "720.31 | \n", "7.7 | \n", "9.0 | \n", "1999.72 | \n", "15.00 | \n", "11.11 | \n", "100.00 | \n", "144.17 | \n", "70.42 | \n", "
4 | \n", "New Haven, CT | \n", "New Haven, CT | \n", "130405 | \n", "38126.0 | \n", "21.7529 | \n", "91.0 | \n", "7.83 | \n", "9554.0 | \n", "381.33 | \n", "7.6 | \n", "8.6 | \n", "1248.22 | \n", "15.00 | \n", "12.10 | \n", "64.50 | \n", "136.00 | \n", "66.67 | \n", "
5 | \n", "Evanston, IL | \n", "Evanston, IL | \n", "75472 | \n", "71317.0 | \n", "27.6540 | \n", "94.0 | \n", "5.72 | \n", "10353.0 | \n", "586.66 | \n", "7.9 | \n", "14.0 | \n", "1116.42 | \n", "15.00 | \n", "10.32 | \n", "105.00 | \n", "131.03 | \n", "61.03 | \n", "
6 | \n", "Chicago, IL | \n", "Chicago, IL | \n", "2714017 | \n", "50434.0 | \n", "32.8802 | \n", "94.0 | \n", "5.72 | \n", "10353.0 | \n", "586.66 | \n", "7.9 | \n", "14.0 | \n", "1116.42 | \n", "15.00 | \n", "10.32 | \n", "105.00 | \n", "131.03 | \n", "61.03 | \n", "
7 | \n", "New York, NY | \n", "New York, NY | \n", "8461961 | \n", "55191.0 | \n", "38.6559 | \n", "66.0 | \n", "4.48 | \n", "10661.0 | \n", "620.74 | \n", "8.2 | \n", "8.6 | \n", "2027.55 | \n", "20.00 | \n", "13.06 | \n", "121.00 | \n", "145.90 | \n", "63.34 | \n", "
8 | \n", "Philadelphia, PA | \n", "Philadelphia, PA | \n", "1559938 | \n", "39770.0 | \n", "31.6483 | \n", "69.0 | \n", "7.18 | \n", "10531.0 | \n", "1094.18 | \n", "6.4 | \n", "11.2 | \n", "1016.93 | \n", "15.00 | \n", "8.90 | \n", "96.00 | \n", "145.61 | \n", "66.85 | \n", "
9 | \n", "Los Angeles, CA | \n", "Los Angeles, CA | \n", "3918872 | \n", "51538.0 | \n", "28.7279 | \n", "72.0 | \n", "6.91 | \n", "11222.0 | \n", "423.79 | \n", "7.9 | \n", "14.4 | \n", "1640.57 | \n", "15.00 | \n", "10.30 | \n", "100.00 | \n", "148.04 | \n", "58.86 | \n", "
10 | \n", "La Jolla, CA | \n", "San Diego, CA | \n", "1374812 | \n", "68117.0 | \n", "21.8709 | \n", "78.0 | \n", "6.94 | \n", "8630.0 | \n", "348.80 | \n", "7.9 | \n", "11.6 | \n", "1552.10 | \n", "15.00 | \n", "8.73 | \n", "72.00 | \n", "132.90 | \n", "65.27 | \n", "
11 | \n", "Ann Arbor, MI | \n", "Ann Arbor, MI | \n", "118087 | \n", "57697.0 | \n", "18.6044 | \n", "176.0 | \n", "6.84 | \n", "9156.0 | \n", "298.89 | \n", "7.3 | \n", "10.5 | \n", "979.92 | \n", "12.00 | \n", "7.13 | \n", "58.00 | \n", "173.53 | \n", "62.27 | \n", "
12 | \n", "Madison, WI | \n", "Madison, WI | \n", "246034 | \n", "56464.0 | \n", "18.4406 | \n", "126.0 | \n", "6.87 | \n", "7353.0 | \n", "230.73 | \n", "8.0 | \n", "9.5 | \n", "926.04 | \n", "12.50 | \n", "10.09 | \n", "65.00 | \n", "115.07 | \n", "53.96 | \n", "
13 | \n", "Ithaca, NY | \n", "Ithaca, NY | \n", "30625 | \n", "30291.0 | \n", "14.6127 | \n", "87.0 | \n", "7.36 | \n", "6796.0 | \n", "122.76 | \n", "7.5 | \n", "8.7 | \n", "816.67 | \n", "14.00 | \n", "8.18 | \n", "45.00 | \n", "186.04 | \n", "62.41 | \n", "
14 | \n", "Durham, NC | \n", "Durham, NC | \n", "251761 | \n", "52115.0 | \n", "21.4121 | \n", "123.0 | \n", "7.92 | \n", "8157.0 | \n", "613.43 | \n", "6.5 | \n", "9.4 | \n", "900.00 | \n", "13.50 | \n", "8.63 | \n", "36.00 | \n", "154.91 | \n", "57.43 | \n", "
15 | \n", "Minneapolis, MN | \n", "Minneapolis, MN | \n", "404670 | \n", "52611.0 | \n", "21.5504 | \n", "117.0 | \n", "4.12 | \n", "7948.0 | \n", "424.22 | \n", "8.0 | \n", "11.7 | \n", "1075.94 | \n", "15.00 | \n", "10.12 | \n", "88.00 | \n", "137.31 | \n", "58.38 | \n", "
16 | \n", "Providence, RI | \n", "Providence, RI | \n", "178851 | \n", "37366.0 | \n", "21.5225 | \n", "95.0 | \n", "5.82 | \n", "8994.0 | \n", "330.98 | \n", "7.3 | \n", "8.3 | \n", "965.91 | \n", "13.00 | \n", "11.01 | \n", "70.00 | \n", "156.22 | \n", "77.50 | \n", "
17 | \n", "Pittsburgh, PA | \n", "Pittsburgh, PA | \n", "305305 | \n", "42450.0 | \n", "22.5548 | \n", "108.0 | \n", "6.59 | \n", "10364.0 | \n", "401.08 | \n", "7.3 | \n", "14.7 | \n", "845.87 | \n", "14.00 | \n", "9.19 | \n", "97.50 | \n", "172.65 | \n", "76.36 | \n", "
18 | \n", "College Park, MD | \n", "College Park, MD | \n", "31942 | \n", "64694.0 | \n", "27.8162 | \n", "52.0 | \n", "10.23 | \n", "8359.0 | \n", "509.39 | \n", "7.3 | \n", "9.9 | \n", "1500.00 | \n", "12.75 | \n", "11.01 | \n", "40.00 | \n", "142.50 | \n", "59.88 | \n", "
19 | \n", "Rochester, NY | \n", "Rochester, NY | \n", "210291 | \n", "31684.0 | \n", "19.5008 | \n", "103.0 | \n", "5.85 | \n", "8289.0 | \n", "344.67 | \n", "7.4 | \n", "9.7 | \n", "859.67 | \n", "15.00 | \n", "12.13 | \n", "49.74 | \n", "116.00 | \n", "55.26 | \n", "
20 | \n", "Boston, MA | \n", "Boston, MA | \n", "658279 | \n", "58516.0 | \n", "29.2652 | \n", "151.0 | \n", "3.84 | \n", "9272.0 | \n", "815.16 | \n", "7.6 | \n", "10.0 | \n", "1742.37 | \n", "15.00 | \n", "13.32 | \n", "84.50 | \n", "156.22 | \n", "62.67 | \n", "
21 | \n", "Baltimore, MD | \n", "Baltimore, MD | \n", "621000 | \n", "44262.0 | \n", "29.4086 | \n", "93.0 | \n", "8.42 | \n", "9825.0 | \n", "1388.61 | \n", "5.9 | \n", "10.3 | \n", "1059.50 | \n", "15.00 | \n", "8.26 | \n", "72.00 | \n", "158.09 | \n", "72.62 | \n", "
22 | \n", "Chestnut Hill, MA | \n", "Boston, MA | \n", "658279 | \n", "58516.0 | \n", "29.2652 | \n", "151.0 | \n", "3.84 | \n", "9272.0 | \n", "815.16 | \n", "7.6 | \n", "10.0 | \n", "1742.37 | \n", "15.00 | \n", "13.32 | \n", "84.50 | \n", "156.22 | \n", "62.67 | \n", "
23 | \n", "University Park, PA | \n", "State College, PA | \n", "42074 | \n", "31618.0 | \n", "15.9115 | \n", "75.0 | \n", "8.43 | \n", "9838.0 | \n", "91.40 | \n", "7.7 | \n", "9.9 | \n", "770.00 | \n", "13.00 | \n", "6.59 | \n", "79.00 | \n", "172.65 | \n", "76.36 | \n", "
24 | \n", "Austin, TX | \n", "Austin, TX | \n", "907779 | \n", "60939.0 | \n", "22.0706 | \n", "85.0 | \n", "9.56 | \n", "10041.0 | \n", "345.75 | \n", "6.6 | \n", "10.0 | \n", "1057.99 | \n", "15.00 | \n", "8.46 | \n", "41.25 | \n", "155.48 | \n", "59.41 | \n", "
25 | \n", "St. Louis , MO | \n", "St. Louis, MO | \n", "316030 | \n", "36809.0 | \n", "23.2125 | \n", "83.0 | \n", "10.60 | \n", "9977.0 | \n", "1702.75 | \n", "4.9 | \n", "10.5 | \n", "711.12 | \n", "12.50 | \n", "6.91 | \n", "74.00 | \n", "181.30 | \n", "53.24 | \n", "
26 | \n", "East Lansing, MI | \n", "East Lansing, MI | \n", "48395 | \n", "34153.0 | \n", "14.6412 | \n", "106.0 | \n", "7.43 | \n", "9656.0 | \n", "552.79 | \n", "6.1 | \n", "10.1 | \n", "979.92 | \n", "12.50 | \n", "6.08 | \n", "33.00 | \n", "174.34 | \n", "58.33 | \n", "
27 | \n", "Columbus, OH | \n", "Columbus, OH | \n", "837038 | \n", "47156.0 | \n", "20.7537 | \n", "101.0 | \n", "8.36 | \n", "9720.0 | \n", "429.31 | \n", "6.6 | \n", "12.3 | \n", "809.83 | \n", "13.00 | \n", "7.49 | \n", "62.00 | \n", "168.99 | \n", "56.73 | \n", "