This is a selected list of stuff i read and found interesting recently. It will be updated regulary(whatever that means).
It could be a link to a paper, a repo or a blog post. The selection is highly subjective, reflects my interests and depends on my reading habit, time, people i follow and recommender algorithms from the platforms i visit.
I am not responsible for the linked content and it does not reflect my personal opinion
- 15.01.2024 - Understanding and Coding Attention in LLMs [Blogpost]
- 25.12.2023 - Stop Resampling [Blogpost]
- 19.12.2023 - LLM Course [Repo]
- 17.12.2023 - Advanced RAG Techniques: an Illustrated Overview [Blogpost]
- 19.11.23 - Practical Tips for Finetuning LLMs Using LoRA [Blogpost]
- 10.10.23 - Transportation Energy Emission Reduction with AI [Blogpost]
- 07.10.23 - Retrieval meets Long Context Large Language Models [Paper]
- 05.10.23 - Challenges in evaluating AI systems [Blogpost]
- 03.10.23 - Unmoderated european LLM Mistral AI [Website]
- 02.10.23 - OpenAI Cookbook [Repo]
- 25.09.23 - Nougat: Neural Optical Understanding [Paper]
- 19.09.23 - Stable Audio by Stability AI [Website]
- 14.09.23 - RAG on GCP with PaLM and LangChain [Blogpost]
- 14.09.23 - RAG vs Finetuning [Blogpost]
- 08.09.23 - Times AI 100 [Article]
- 07.09.23 - Summary on new Foundation Models LLaMA2 and CodeLLaMA [Blogpost]
- 07.09.23 - GPT Pilot - An LLM driven dev tool to create Apps [Repo]
- 04.09.23 - Instruction Tuning for Large Language Models: A Survey [Paper]
- 02.09.23 - SynthID for Watermarks in AI generated content [Article]
- 11.08.23 - Generative Agent Simulation is open source [Repo]
- 08.08.23 - AI mitigates the climate impact of contrails [Blogpost]
- 11.06.23 - The ChatGPT revolution is another tech fantasy [Blogpost]
- 31.07.23 - Generative Agents for your help [Repo]
- 22.07.23 - E2E Preprocessing on GCP [Blogpost]
- 18.07.23 - Vodafone: A DevOps approach to AI/ML [Blogpost]
- 07.07.23 - Stop autonomous cars with Traffic Cones [Article]
- 11.06.23 - Data Engineering with Dataflow and Vertex AI [Blogpost]
- 23.06.23 - GPT4 is a mixture of smaller models [Blogpost]
- 11.06.23 - Practical steps to reduce hallucination for LLM [Blogpost]
- 07.06.23 - Direct Preference Optimization [Paper]
- 02.06.23 - Metas Segment anything Model(SAM) [Repo]
- 01.06.23 - Statement on AI risk backed by OpenAI, Deepmind [Website]
- 30.05.23 - The Diffusion Camera [Website]
- 27.05.23 - AI created Trailer for LOTR in Wes Anderson style [Blogpost]
- 26.05.23 - Good performing LLM’s without RHLF? [Paper]
- 26.05.23 - Don’t get distracted by the hype around AI [Blogpost]
- 19.05.23 - Get started with LangChain [Blogpost]
- 12.05.23 - Stable Animation [Website]
- 14.04.23 - Generative Agents: Interactive Simulacra of Human Behavior [Paper]
- 12.04.23 - Sparks of AGI [Paper]
- 01.04.23 - Pope Francis the Rapper [Article]
- 30.03.23 - BloombergGPT [Website]
- 29.03.23 - Open Letter for Pause for A.I. [Article]
- 20.03.23 - Stanford created a LLM for 600$ [Article]
- 18.03.23 - ChatGPT vs GPT4 [Blogpost]
- 17.03.23 - How Vodafone Uses TensorFlow Data Validation [Blogpost]
- 16.03.23- MLOps Zoomcamp - Free Training [Repo]
- 16.03.23- Data Engineering Zoomcamp - Free Training [Repo]
- 15.03.23 - Plot Neural Net in Latex [Repo]
- 15.03.23 - GPT4 Developer Livestream [Video]
- 10.03.23 - LLM’s explained for nontechnical people [Blogpost]
- 27.02.23 - Prompt Engineering Guide [Repo]
- 07.02.23 - Google Research, 2022 & beyond: ML & computer systems [Blog]
- 07.02.23 - Stable Attribution - find biggest attribution to your diffusion image from model training [Website]
- 05.02.23 - Transformer Model Catalog [Blog]
- 02.02.23 - Constitutional AI [Paper]
- 02.02.23 - What happens when LLMs run out of human written text [Article]
- 26.01.23 - Plant undetectable Backdoors in ML Models [Paper]
- 23.01.23 - Google Research, 2022 & Beyond: Language, Vision and Generative Models [Blog]
- 22.01.23 - Deep Learning Tuning Playbook [Repo]
- 22.01.23 - ChatGPT used Sweatshop in Kenya [Article]
- 21.01.23 - Andrej Karpathy builds a GPT from Scratch [Video]
- 20.01.23 - Pytorch vs Tensorflow in 2023 [Blog]
- 18.01.23 - Why you should use Google JAX [Blog]
- 18.01.23 - 24 embarassing hours for AI [Blog]
- 16.01.23 - Why do tree-based models still outperform deep learning on tabular data? [Paper]
- 16.01.23 - Transformer Inference Optimization [Blog]
- 15.01.23 - GPTZero [Website]
- 11.01.23 - Open Source ChatGPT [Repo]
- 05.01.23 - Best Papers in 2022 [Repo]