Practical work is the goal. Don’t fool yourself into thinking you know what is taught by just watching the videos or reading the book. Watch the video, watch it again but with the tools at hand, pausing to try stuff out, do a project using what you learned. The title of this blog is me trying to remind myself of this concept.
I have my own ML workstation setup with a GPU. I used poetry to set it up. This was a pain due to lack of attention to repeatability in Jupyter Notebooks. It would be very useful if they had a “lock” feature that records all python packages in the environment they are run it and their exact version. Below is what it looks like after finishing Lesson 1 (the next video).
Be tenacious. Finish a project.
One message that’s very close to my heart is not keep on getting ready to do a project, like stopping to learn linear algebra (and then remembering how I never learned all the math terms, and therefore want to go back even deeper) in order to do well on this course. Try to do a complete project, then on the next project go deeper, dig in deeper when the code needs it.
Show your work to the world. Blog not to be a breaking news source, but blog for the audience of yourself 6 months ago.
The Economist covers It doesn’t take much to make machine-learning algorithms go awry. Will we see a core of knowledge built that considered “The Truth”, and then all other input data is evaluated on how likely that’s true based on the givens? LLMs judging what is fed to their younger siblings?
In 2012 I spent a lot of my rare free time working thru both Sebastian Thrun and Peter Norvig’s
Intro to Artificial Intelligence (which I can’t find anymore) and Andrew Ng’s Machine Learning course. Andrew’s was the better of the two.
One thing that blew me away was how he used k-means clustering on our homework submissions to discover where there were a large number of students that had a common misconception, and make a clarification video.
At the time I was working in R&D at Rosetta Stone, and we wanted to bring data science and machine learning to bear on how to improve our language learning offerings. It was a linear course, and we dreamed of building a model of what our learners knew, and then challenging them on what they didn’t know. Duolingo had a much better vision and execution for this.
I have a lot of data on comic books. I’m hoping to build some practical applications for this. One idea is to feed each panel into something like LLaVA and have it describe what’s going on in it, then have it summarize the story.