This is a summary of the paper "Collective intelligence for deep learning: A survey of recent developments," which explores the relationship between deep learning and collective intelligence. Introduction Collective intelligence for deep l…
This is a summary of the GPT-2 paper "Language Models are Unsupervised Multitask Learners." Introduction Language Models are Unsupervised Multitask Learners Overview Method Creating the WebText Training Dataset BPE: Byte Pair Encoding Mode…
This is a quick reference note on the differences between torch.tensor() and torch.Tensor(). Introduction Differences Between torch.tensor and torch.Tensor In Short In Detail Note: Creating Empty Tensors Conclusion References Introduction …
I created my own game using Python. This article documents everything from the preparation involved in game development to the implementation challenges I encountered, and the final game design. Introduction Wanting to Create Something Why…
This article explains how to compare user-defined class objects for equality in Python. Introduction Comparing User-Defined Class Objects for Equality in Python Comparing Instance Variables in the __eq__ Method Comparing __dict__ in the __…
This article explains how to check memory and disk space on your execution environment using Python. Introduction Getting Disk and Memory Capacity Disk Space Memory Capacity Conclusion References Introduction I recently needed to check mem…
I'll verify what happens when calling the OpenAI API from Python with both stream and timeout settings enabled. Introduction Behavior When Combining GPT Stream and Timeout Hypothesis Testing Method Test Results Summary Conclusion Reference…
This article explains how to implement GPT streaming responses using the OpenAI API in Python. Introduction Implementing GPT Stream Responses with OpenAI API Conclusion References Introduction When you call GPT using the OpenAI API, the de…
This is a guide on how to set timeouts when calling the OpenAI API using the OpenAI Python Library. Introduction Setting Timeout in the OpenAI Python Library How to: Using the request_timeout Parameter A Pitfall: The timeout Parameter Conc…
This is a summary of the review paper "Organoid intelligence (OI): the new frontier in biocomputing and intelligence-in-a-dish," which covers developments in Organoid Intelligence (OI). Introduction Organoid intelligence (OI): the new fron…
This is a summary of the first GPT paper, "Improving Language Understanding by Generative Pre-Training." Introduction Improving Language Understanding by Generative Pre-Training Overview Method Results Natural Language Inference Tasks Ques…
This is a summary of the seminal paper "Attention Is All You Need," which introduced the Transformer architecture. Introduction Attention Is All You Need Overview Method Model Architecture Training Method Results Translation Tasks Transfor…
This is a summary of the paper "Evolutionary Optimization of Model Merging Recipes," which describes Sakana.ai's evolutionary model merging approach. Introduction Evolutionary Optimization of Model Merging Recipes Overview Method Results L…
This article explores the differences and similarities between Active Inference from the Free Energy Principle and LLMs (Large Language Models), based on the paper "Predictive Minds: LLMs As Atypical Active Inference Agents." Introduction …
I'll share techniques for reading research papers from Andrew Ng's lecture video. Introduction How to Read Research Papers According to Andrew Ng Reading Papers Checking Your Understanding Conclusion References Introduction Recently, as I'…