Python_data_science_第六课
今天是Predictive Model (预测模型)
Linear Regression: AKA likelihood estimation.
Fit a line to a data set of observations.
Use this line to predict unobserved values.
其实Linear Regression 画出来就是一条直线。尽量靠近所有的数据点。
Linear Regression 可以用多种算法来做。
Linear Regression use a technique called ‘ordinary least squares’: OLS
least squares(最小二乘法)
Minimize the s...
Python_data_science_第五课
conditional probability and Bayes’ Theorem
这两个理论看的我头晕。
conditional probability
if I have two events that depend on each other, what’s the probability that both will occur.
Notation:
P(A,B) is the probability of A and B both occuring independent of each other.
P(B|A): probability of B given A has already occurred.
we know:
P(B|A) =...
Python_data_science_第四课
今天是covariance and correlation.
代码都是在ipython3里面敲的,所以prereq如下:
ipython3
In [1]: import numpy as np
In [2]: import matplotlib.pyplot as plt
they give us a means of measuring just how tight these things are correlated
covariance: Measures how two variables vary in tandem from their means.
correlation: -1 negative(in...
Python_data_science_第三课
今天是percentiles 和moments.
percentiles: in a data set, what’s the point at which x% of the values are less than that value.
例如90% percentile就是大于90%,50% percentile就是median.
IQR: interquartile range
when we talk about a distribution,IQR is the area in the middle of the distribution that contains 50% of the values.
#find percentiles
v...
使用prime95做cpu压力测试
上次因为cpu的发热问题, 我把cpu的硅胶给重新换了下。想试试什么时候能再引发这个温度过热,cpu自动降频的问题.
dmesg | grep cpu
[22115.380360] CPU0: Core temperature above threshold, cpu clock throttled (total events = 1)
[22115.380362] CPU3: Package temperature above threshold, cpu clock throttled (total events = 1)
[22115.380364] CPU2: Core temperature above threshold, cpu cl...
记录一次kali_linux_distupgrade碰到的问题
今天得到的教训,好久没有升级kali了,想升级下
apt-get update
apt-get dist-upgrade
apt-get upgrade
升级一切顺利,然后可以看到登录页面,输入root 口令后直接就卡死。 思考一下这个问题,可以看到登录界面,但是输入root口令后假死,应该是gnome 这些GUI上的问题。
按电源键关机,启动进入recovery mode.
首先是启动网卡: service network-manager start
不管那么多,直接先把gnome全部卸载掉。
apt-get purge gnome
apt-get purge gnome*
重启进入系统,可以直接Ctrl + Alt +...
Python_data_science_第二课
####我主要看的书叫 Hands-On Data Science and Python Machine Learning. 作者Frank kane
今天是PMF 和PDF, 其实就是Data Distribution. 单纯看代码比较不直观,但是一用matplotlib 把图绘出来,就会变的非常直观。
Terminology difference: A probability density function is a solid curve that describes the probability of a range of values happening with continuous data. A probability mass
function is the...
Python_data_science_第一课
####Data Science 打算写一系列的笔记,记录下平时看书,看视频学到的知识.
今天是第一课.
Mean, Mode, Median.
Mean AKA Averate: sum/ number of samples
Median: sort the values, and take the value at the midpoint, for even numbers
then take the average of the midpoint 2.
Mode: the most common value in a data set, which means this data occurs the most time.
...
87 post articles, 11 pages.