统计面试题准备 – 最全 AB test interview preparation DS面试复习资料

1)Google has a very good class for AB testing below. It’s on Udacity. If you just want see an example of sample size calculation, this particular part will guide you through the process with click through rate as an example. lesson 1 part 23: quiz ‘Calculating Number of Pages Views Needed’. https://classroom.udacity.com/courses/ud257/lessons/4018018619/concepts/40043987060923 Here is some … Read more

Everyday SQL (13) interview question

We have 3 tables Orders table (ORDER_ID, CUSTOMER_ID, ORDER_DATE) Detail table (ORDER_ID, PRODUCT_ID, PRICE, UNITS) Customers table (CUSTOMER_ID, SIGNUP_DATE, COUNTRY) Data request: can you write a query to calculate % of customers that made an order each month, by signup month. Please return 3 columns: signup month, order month, % of customers Possible solution: with … Read more

Everyday SQL (12)

data: code A1 A2 A11 A22 A111 A222 A3333 A4444   We want to select a data set as follows. A1 A2 A11 A22 A111 A222   Solution: SELECT code FROM data WHERE code LIKE ‘A[0-9]’  — 1 digit OR    code LIKE ‘A[1-9][0-9]’  — 2 digits OR    code LIKE ‘A[1-9][0-9][0-9]’  — 3 digits   We … Read more

Everyday SQL (10) 10 SQL interview MUST PREPARED topics and answers — Data Scientist SQL面试题最全题库

I found there are lots of resource for SQL interview questions. It can help you on refreshing your knowledge of sql. Go through the below list before you go to an interview which might test your sql skill. 1. 练习题库 题库一 https://leetcode.com/problemset/database/ 题库二 https://www.hackerrank.com/domains/sql If we would not use subquery, we can use temporary table. … Read more

Everyday SQL (8) SQL- Date shift by day, week, month, quarter, or year

Add days, weeks, month, quarter, year to a date: use positive number #minus days, weeks, month, quarter, year from a date: use negative number Example # minus 1 week from January 18th, 2019 : processed_date >= dateadd(week, -1, ‘20190118’) #or processed_date >= dateadd(weeks, -1, ‘20190118’) #week and weeks are the same for dateadd function. #you … Read more

Combine excel files with the same format into one – Python code example

import pandas as pd import numpy as np import dateutil import glob file_names=glob.glob(“/Users/XXXX/Downloads/XXXX/u*.xlsx”) combining excel .xlsx files starting with u together all_data = pd.DataFrame() for f in glob.glob(“/Users/XXXXX/Downloads/XXXXX/u*.xlsx”): df = pd.read_excel(f) all_data = all_data.append(df,ignore_index=True) all_data.shape write it to excel writer = pd.ExcelWriter(‘/Users/XXX/Downloads/file/combined_samples.xlsx’) all_data.to_excel(writer) extracting column a=all_data[[“a”]] counting unique value in a column all_data[[“a”]].nunique() taking unique … Read more

长期亚麻内推

长期内推,刚毕业无实习经验无项目经验的的朋友建议先自己投intern,contractor等职位(建议详见http://everydayds.com/2016/09/07/%e5%bc%80%e5%9c%ba%e7%99%bd/),有经验后再联系继续给您内推。 各种职位各个地区都可以,如SDE, Data Scientist, Data Analyst, PM等, 请发简历和自己没有申请过的job ID 到 gmail   liuruilan123@ 职位都可以通过内推系统内推,如果自己已经申请过,请不要发信内推,因为内推系统无法提交。 - 工程PHD,修了个统计学位转的行做数据分析,希望可以帮到你。 Data Analyst 工作日常SQL R code 博客分享,工科转行到IT工作经历。 http://everydayds.com/