Focus AI, the latest AI news in only 3 minutes !
This week:
🔥 Introducing ChatGPT and Whisper APIs
🗺️ The 2023 MAD (Machine Learning, Artificial Intelligence & Data) Landscape
🪖 New Rules for Military AI
👁️🗨️ From Pandemic to Panopticon
And about code:
📚 [Product Launch] Introducing Kaggle Models
📊 PyGWalker
🤿 dstack, reproducible ML workflows
🛀 Data cleaning for data sharing
💪 Intro to Reinforcement Learning Tutorial | Kaggle
🐻❄️ Awesome Polars
🏋️ Awesome DS Setting
🐍 Google Python Style Guide
👩🏫 Course: Data Visualization Fundamentals and Best Practices
== News ==
🔥 Introducing ChatGPT and Whisper APIs
ChatGPT and Whisper models are now available on our API, giving developers access to cutting-edge language (not just chat!) and speech-to-text capabilities. Through a series of system-wide optimizations, they’ve achieved 90% cost reduction for ChatGPT since December; they’re now passing through those savings to API users.
🗺️ The 2023 MAD (Machine Learning, Artificial Intelligence & Data) Landscape
The annual MAD (Machine Learning, Artificial Intelligence and Data) landscape is our attempt at making sense of this vibrant space. Its general philosophy has been to open source work that we would do anyway, and start a conversation with the community.
So, here we are again, in 2023. This is our ninth annual landscape and “state of the union” of the data and AI ecosystem.
🪖 New Rules for Military AI
Nations tentatively agreed to limit their use of autonomous weapons.
Representatives of 60 countries endorsed a nonbinding resolution that calls for responsible development, deployment, and use of military AI. Parties to the agreement include China and the United states but not Russia.
👁️🗨️ From Pandemic to Panopticon
Governments are repurposing Covid-focused face recognition systems as tools of repression.
Russia’s internal security forces are using Moscow’s visual surveillance system, initially meant to help enforce pandemic-era restrictions, to crack down on anti-government dissidents or protestors against the war in Ukraine, Wired reported.
ps: if you like this content, feel free to share it or to contribute with a small (or large) gift subscription)
== Code & Tech ==
📚 [Product Launch] Introducing Kaggle Models
You’ve heard of Kaggle Datasets. And you know Kaggle Competitions. Today, meet the newest addition: Kaggle Models! Kaggle Models is where you will discover and use pretrained models through deep integrations with the rest of Kaggle’s platform.
Pretrained models define the current paradigm for doing ML. With a dedicated hub for Models, using pretrained models in Competitions will become easier and the community will in turn create and capture more of the knowledge about models.
📊 PyGWalker
Turn your pandas dataframe into a Tableau-style User Interface for visual analysis.
🤿 dstack, reproducible ML workflows
dstack is an open-source tool that allows running reproducible ML workflows independently of infrastructure. It allows running ML workflows locally or remotely, using any configured cloud vendor.
🛀 Data cleaning for data sharing
What steps do we expect someone to take before handing off a dataset that is considered “clean”? And I will tell you right now, there is no standard answer for this question.
💪 Intro to Reinforcement Learning Tutorial | Kaggle
Stone Tao, co-founder of Lux AI, walks through his reinforcement learning tutorial – it's a perfect quick start into how RL generally works, and how to program a basic agent.
🐻❄️ Awesome Polars
A curated list of Polars docs, talks, tools, examples & articles the internet has to offer.
Polars is a lightning-fast DataFrame library for Rust, Python, Node.js and R.
Implemented in Rust, Polars uses Apache Arrow Columnar Format as the memory model.
🏋️ Awesome DS Setting
After setting/reinstalling a couple of machines from scratch in the last few months, I decided for once and for all to document my default data science settings and tools I typically used.
That includes installing programming languages such as R, Julia, and Python and their supporting IDEs RStudio and VScode. In addition, set the terminal, git, and install supporting tools such as iTerm2, oh-my-zsh, Docker, etc.
🐍 Google Python Style Guide
Python is the main dynamic language used at Google. This style guide is a list of dos and don’ts for Python programs.
👩🏫 Course: Data Visualization Fundamentals and Best Practices
When do you use a bar chart over a line chart? What are area charts good for? What's wrong with pie charts? Learn about how these different types of data visualization work, and how they're used, in Observable's first data visualization course!
#ChatGPT #Kaggle #DataViz #Polars
A big thanks 😙 to our sources: https://jack-clark.net/, https://www.actuia.com/, https://thevariable.com/news/, https://techcrunch.com/, https://read.deeplearning.ai/the-batch/
What about you? Have you noticed something else?
Don’t miss the next news!
Have a good week-end,
Maxime 🙃 from Toulouse, France with 🌺