Sveriges 100 mest populära podcasts
Conversation with Dirk-Jan Kubeflow (vs cloud native solutions like SageMaker)
Dirk-Jan Verdoorn - Data Scientist at Dept Agency
Kubeflow. (From the website:) The Machine Learning Toolkit for Kubernetes. The Kubeflow project is dedicated to making deployments of machine learning (ML) workflows on Kubernetes simple, portable and scalable. Our goal is not to recreate other services, but to provide a straightforward way to deploy best-of-breed open-source systems for ML to diverse infrastructures. Anywhere you are running Kubernetes, you should be able to run Kubeflow.
TensorFlow Extended (TFX). If using TensorFlow with Kubeflow, combine with TFX for maximum power. (From the website:) TensorFlow Extended (TFX) is an end-to-end platform for deploying production ML pipelines. When you're ready to move your models from research to production, use TFX to create and manage a production pipeline.
Alternatives:
Airflow MLflowChatting with co-workers about the role of DevOps in a machine learning engineer's life
Expert coworkers at Dept
Matt Merrill - Principal Software Developer Jirawat Uttayaya - DevOps Lead The Ship It Podcast (where Matt features often)Devops tools
Terraform AnsiblePictures (funny and serious)
Which AWS container service should I use? A visual guide on troubleshooting Kubernetes deployments Public Cloud Services Comparison Killed by Google aCloudGuru AWS curriculum(Optional episode) just showcasing a cool application using machine learning
Dept uses Descript for some of their podcasting. I'm using it like a maniac, I think they're surprised at how into it I am. Check out the transcript & see how it performed.
Descript The Ship It Podcast How to ship software, from the front lines. We talk with software developers about their craft, developer tools, developer productivity and what makes software development awesome. Hosted by your friends at Rocket Insights. AKA shipit.io Brandbeats Podcast by BASIC An agency podcast with views on design, technology, art, and culture. Explore the new microsite at www.brandbeats.basicagency.comShow notes: ocdevel.com/mlg/mla-17
Developing on AWS first (SageMaker or other)
Consider developing against AWS as your local development environment, rather than only your cloud deployment environment. Solutions:
Stick to AWS Cloud IDEs (Lambda, SageMaker Studio, Cloud9Connect to deployed infrastructure via Client VPN
Terraform example YouTube tutorial Creating the keys LocalStackInfrastructure as Code
Terraform CDK ServerlessMLOps is deploying your ML models to the cloud. See MadeWithML for an overview of tooling (also generally a great ML educational run-down.)
SageMaker Jumpstart Deploy Pipelines Monitor Kubernetes NeoMLOps is deploying your ML models to the cloud. See MadeWithML for an overview of tooling (also generally a great ML educational run-down.)
SageMaker DataWrangler Feature Store Ground Truth Clarify Studio AutoPilot Debugger Distributed TrainingAnd I forgot to mention JumpStart, I'll mention next time.
L1/L2 norm, Manhattan, Euclidean, cosine distances, dot product
Normed distances link
A norm is a function that assigns a strictly positive length to each vector in a vector space. link Minkowski is generalized. p_root(sum(xi-yi)^p). "p" = ? (1, 2, ..) for below. L1: Manhattan/city-block/taxicab. abs(x2-x1)+abs(y2-y1). Grid-like distance (triangle legs). Preferred for high-dim space. L2: Euclidean. sqrt((x2-x1)^2+(y2-y1)^2. sqrt(dot-product). Straight-line distance; min distance (Pythagorean triangle edge) Others: Mahalanobis, Chebyshev (p=inf), etcDot product
A type of inner product.Cosine (normalized dot)
Hyperparameters part 2: hyper-search, regularization, SGD optimizers, scaling. ocdevel.com/mlg/28 for notes and resources
ocdevel.com/mlg/12 for notes and resources
Show notes at ocdevel.com/mlg/2
Updated! Skip to [00:29:36] for Data Science (new content) if you've already heard this episode.
What is artificial intelligence, machine learning, and data science? What are their differences? AI history.
Hierarchical breakdown: DS(AI(ML)). Data science: any profession dealing with data (including AI & ML). Artificial intelligence is simulated intellectual tasks. Machine Learning is algorithms trained on data to learn patterns to make predictions.
Artificial Intelligence (AI) - WikipediaOxford Languages: the theory and development of computer systems able to perform tasks that normally require human intelligence, such as visual perception, speech recognition, decision-making, and translation between languages.
AlphaGo Movie, very good!
Reasoning, problem solving Knowledge representation Planning Learning Natural language processing Perception Motion and manipulation Social intelligence General intelligenceApplications
Autonomous vehicles (drones, self-driving cars) Medical diagnosis Creating art (such as poetry) Proving mathematical theorems Playing games (such as Chess or Go) Search engines Online assistants (such as Siri) Image recognition in photographs Spam filtering Prediction of judicial decisions Targeting online advertisements Machine Learning (ML) - WikipediaOxford Languages: the use and development of computer systems that are able to learn and adapt without following explicit instructions, by using algorithms and statistical models to analyze and draw inferences from patterns in data.
Data Science (DS) - WikipediaWikipedia: Data science is an interdisciplinary field that uses scientific methods, processes, algorithms and systems to extract knowledge and insights from noisy, structured and unstructured data, and apply knowledge and actionable insights from data across a broad range of application domains. Data science is related to data mining, machine learning and big data.
History Greek mythology, Golums First attempt: Ramon Lull, 13th century Davinci's walking animals Descartes, Leibniz1700s-1800s: Statistics & Mathematical decision making
Thomas Bayes: reasoning about the probability of events George Boole: logical reasoning / binary algebra Gottlob Frege: Propositional logic 1832: Charles Babbage & Ada Byron / Lovelace: designed Analytical Engine (1832), programmable mechanical calculating machines1936: Universal Turing Machine
Computing Machinery and Intelligence - explored AI! 1946: John von Neumann Universal Computing Machine 1943: Warren McCulloch & Walter Pitts: cogsci rep of neuron; Frank Rosemblatt uses to create Perceptron (-> neural networks by way of MLP)50s-70s: "AI" coined @Dartmouth workshop 1956 - goal to simulate all aspects of intelligence. John McCarthy, Marvin Minksy, Arthur Samuel, Oliver Selfridge, Ray Solomonoff, Allen Newell, Herbert Simon
Newell & Simon: Hueristics -> Logic Theories, General Problem Solver Slefridge: Computer Vision NLP Stanford Research Institute: Shakey Feigenbaum: Expert systems GOFAI / symbolism: operations research / management science; logic-based; knowledge-based / expert systems 70s: Lighthill report (James Lighthill), big promises -> AI Winter90s: Data, Computation, Practical Application -> AI back (90s)
Connectionism optimizations: Geoffrey Hinton: 2006, optimized back propagation Bloomberg, 2015 was whopper for AI in industry AlphaGo & DeepMindShow notes: ocdevel.com/mlg/1. MLG teaches the fundamentals of machine learning and artificial intelligence. It covers intuition, models, math, languages, frameworks, etc. Where your other ML resources provide the trees, I provide the forest. Consider MLG your syllabus, with highly-curated resources for each episode's details at ocdevel.com. Audio is a great supplement during exercise, commute, chores, etc.
MLG, Resources Guide Gnothi (podcast project): website, Github Tyler's Battlestation What is this podcast? "Middle" level overview (deeper than a bird's eye view of machine learning; higher than math equations) No math/programming experience requiredWho is it for
Anyone curious about machine learning fundamentals Aspiring machine learning developersWhy audio?
Supplementary content for commute/exercise/chores will help solidify your book/course-workWhat it's not
News and Interviews: TWiML and AI, O'Reilly Data Show, Talking machines Misc Topics: Linear Digressions, Data Skeptic, Learning machines 101 iTunesU issuesPlanned episodes
What is AI/ML: definition, comparison, history Inspiration: automation, singularity, consciousness ML Intuition: learning basics (infer/error/train); supervised/unsupervised/reinforcement; applications Math overview: linear algebra, statistics, calculus Linear models: supervised (regression, classification); unsupervised Parts: regularization, performance evaluation, dimensionality reduction, etc Deep models: neural networks, recurrent neural networks (RNNs), convolutional neural networks (convnets/CNNs) Languages and Frameworks: Python vs R vs Java vs C/C++ vs MATLAB, etc; TensorFlow vs Torch vs Theano vs Spark, etc