Blog

Python_data_science_第六课

今天是Predictive Model (预测模型) Linear Regression: AKA likelihood estimation. Fit a line to a data set of observations. Use this line to predict unobserved values. 其实Linear Regression 画出来就是一条直线。尽量靠近所有的数据点。 Linear Regression 可以用多种算法来做。 Linear Regression use a technique called ‘ordinary least squares’: OLS least squares(最小二乘法) Minimize the s...

Read more

Python_data_science_第五课

conditional probability and Bayes’ Theorem 这两个理论看的我头晕。 conditional probability if I have two events that depend on each other, what’s the probability that both will occur. Notation: P(A,B) is the probability of A and B both occuring independent of each other. P(B|A): probability of B given A has already occurred. we know: P(B|A) =...

Read more

Python_data_science_第四课

今天是covariance and correlation. 代码都是在ipython3里面敲的,所以prereq如下: ipython3 In [1]: import numpy as np In [2]: import matplotlib.pyplot as plt they give us a means of measuring just how tight these things are correlated covariance: Measures how two variables vary in tandem from their means. correlation: -1 negative(in...

Read more

Python_data_science_第三课

今天是percentiles 和moments. percentiles: in a data set, what’s the point at which x% of the values are less than that value. 例如90% percentile就是大于90%,50% percentile就是median. IQR: interquartile range when we talk about a distribution,IQR is the area in the middle of the distribution that contains 50% of the values. #find percentiles v...

Read more

使用prime95做cpu压力测试

上次因为cpu的发热问题, 我把cpu的硅胶给重新换了下。想试试什么时候能再引发这个温度过热,cpu自动降频的问题. dmesg | grep cpu [22115.380360] CPU0: Core temperature above threshold, cpu clock throttled (total events = 1) [22115.380362] CPU3: Package temperature above threshold, cpu clock throttled (total events = 1) [22115.380364] CPU2: Core temperature above threshold, cpu cl...

Read more

记录一次kali_linux_distupgrade碰到的问题

今天得到的教训,好久没有升级kali了,想升级下 apt-get update apt-get dist-upgrade apt-get upgrade 升级一切顺利,然后可以看到登录页面,输入root 口令后直接就卡死。 思考一下这个问题,可以看到登录界面,但是输入root口令后假死,应该是gnome 这些GUI上的问题。 ​按电源键关机,启动进入recovery mode. ​首先是启动网卡: service network-manager start 不管那么多,直接先把gnome全部卸载掉。 apt-get purge gnome apt-get purge gnome* 重启进入系统,可以直接Ctrl + Alt +...

Read more

Python_data_science_第二课

####我主要看的书叫 Hands-On Data Science and Python Machine Learning. 作者Frank kane 今天是PMF 和PDF, 其实就是Data Distribution. 单纯看代码比较不直观,但是一用matplotlib 把图绘出来,就会变的非常直观。 Terminology difference: A probability density function is a solid curve that describes the probability of a range of values happening with continuous data. A probability mass function is the...

Read more

Python_data_science_第一课

####Data Science 打算写一系列的笔记,记录下平时看书,看视频学到的知识. 今天是第一课. Mean, Mode, Median. Mean AKA Averate: sum/ number of samples Median: sort the values, and take the value at the midpoint, for even numbers then take the average of the midpoint 2. Mode: the most common value in a data set, which means this data occurs the most time. ...

Read more