• 编程基础
  • GITHUB
  • How to use global variables correctly in Python

    Aug 23, 2019

    I just accomplish an online algorithm test for an interview, one of the problems ask me to travel thought a tree structure and calculate how many paths satisfied the condition. It can be easily solved by traditional DFS algorithm. I answered this (pseudocode):

    ...more
  • How Python read or write files under the hood

    Aug 14, 2019

    Python makes users easy to operate files by hiding the implementation details for the IO module. This article aim in discourse more details about how it works for people who want to know more about it.

    ...more
  • Anti Spam reviews in App store

    Aug 13, 2019

    Back to the 1980s, the NLP systems still based on complex sets of hand-written rules. These rules become hard to maintain since it grows exponential depend on the length of sentences. Moreover, new words pop up every day so we also have to maintain the old roles before developing any new one. Most important, we can’t use these rules on other languages since the rules are languages-specify.
    With the power of the modern computer, we tried machine learning algorithms like decision trees to replace some of the hand-written rules. As we gain more and more data, we start using statistical models like hidden markov models to predict the probability of the word depends on the input data. Nowadays, with the help from the deep neural network, word embeddings help us to convert a word from higher dimension space to a lower one.

    ...more
  • A crash course in computer Cache

    Jun 13, 2019

    Even though the computer run much faster then we expected nowadays, the speed between the different components are still very large. To solve this problem, we introduce different levels of caches. After access the data from storage, we store the most used (temporal locality) and the nearby data (spatial locality) in the current lever so we can aceesee them directly later. The term storage here is relative. For example, main memory is a storage to CPU cache, and the disk is a storage to main memory. When running a program, it may reach several levels of caches. Let’s look closer to it:

    ...more
  • Why we should hire open source contributors

    Sep 25, 2018

    I spent 2 years in a startup as a software engineer, we have 15 engineers and we proud of our products. Like other companies, we desire to hire the best engineers. But hiring is really hard, I remember a popular article in HN, the article quoted from Thomas Ptacek:

    ...more
  • Skip deleted attributes while calling reset_mock

    Sep 22, 2018

    Today I’m going to review pull_9302

    ...more
  • Remove redundant overflow checks

    Sep 21, 2018

    Today I’m going to review pull_8757, My selection criteria it’s the PR should be

    1. Neither too new or too old
    2. Passed most of the tests and been labeled awaiting review
    3. I can review it in a day.

    (Another python contributor Tal Einat gave me so many advises in my yesterday post, Thank you so much, Tal.)

    ...more
  • Coroutine or Coroutine Function

    Sep 19, 2018

    I’m going to review some PR from the Cpython repo this week.

    ...more
  • Why we never thank open source maintainers

    Nov 23, 2017
    ...more
  • How to make Python GIL better (To be finished)

    Jul 16, 2017
    ...more
NEXT

© 2016 - 2019 Unicooo, unless otherwise noted.