- 빅데이터/인공지능
- Emerging LLM Application Architecture
- A cheat sheet explanation of how Large Language Models work:
- Anthropic, Claude Instant 1.2 출시 (anthropic.com)
- Nvidia reveals new A.I. chip, says costs of running LLMs will ‘drop significantly’
- Candle is a minimalist ML framework for Rust with a focus on performance (including GPU support) and ease of use. Try our online demos: whisper, llama2.
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning
- Making AMD GPUs competitive for LLM inference
- Fine-Tuning Llama-2: A Comprehensive Case Study for Tailoring Models to Unique Applications
- WizardLM is on fire! Seems they released a new model, WizardMath 1 hour ago, that outperforms chatgpt on math skills:
- Tutor-GPT is a LangChain LLM application. It dynamically reasons about your learning needs and updates its own prompts to best serve you.
- Personal co-pilot trained on top 10 HF repos by stars using QLoRA in colab on 1 A100 40GB going brrr in vs code 🔥🧑🏽💻🚀
- nanoLoRA - A Minimalistic Implementation of Low-Rank Adaptation
- Create Your Own Custom LLM Chatbot - Impressive step-by-step tutorial explaining how to choose the best LLM and the components needed for building your own custom LLM-powered chatbot.
- Gartner Identifies Top Trends Shaping the Future of Data Science and Machine Learning
- Towards Generalist Biomedical AI
- Follow Anything: Open-set detection, tracking, and following in real-time
- 언어데이터과학 (2023학년도 2학기, 서울대학교 언어학과)
- 딜로이트 , ‘인공지능 활용서' 발간...소비자 부문, 에너지·자원, 산업재, 금융 등 6대 산업군 AI 활용 사례 및 이점 분석
- StabilityAI, 코드를 위한 LLM 생성형 AI "StableCode" 릴리즈 (stability.ai)
- 🐍📰 Prompt Engineering: A Practical Example
- Ask like a human: Implementing semantic search on Stack Overflow
- Beyond prompting: getting production quality LLM performance with Snorkel Flow
- Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering
- Rift is open-source infrastructure for AI-native development environments. Rift makes your IDE agentic.
- FlagEmbedding can map any text to a low-dimensional dense vector which can be used for tasks like retrieval, classification, clustering, or semantic search. And it also can be used in vector database for LLMs.
- Anomaly Detection in Time Series using ChatGPT
- 사무원은 가고 마법사가 왔다··· 생성형 AI가 바꿔내는 데이터베이스 분야
- One-Click Observability(LlamaIndex)
- Generative Agents: Interactive Simulacra of Human Behavior
- 🦜⚒️Q&A System Correctness 🧠
- Building LLM applications for production
- Getting Started With LLMs(Python · Kaggle - LLM Science Exam)
- Welcome to , the world's most extensive scholarly knowledge graph with over 26 billion RDF triples.
- SemOpenAlex: The Scientific Landscape in 26 Billion RDF Triples
- pgvecto.rs is a Postgres extension that provides vector similarity search functions.
- Here is how you can obtain a massive speedup with llama-v2 models, much faster than anything else I tried.
- Getting from Generative AI to Trustworthy AI: What LLMs might learn from Cyc
- MS, 엔비디아 H100 GPU 서비스 정식 출시
- LanceDB is an open-source database for vector-search built with persistent storage, which greatly simplifies retrevial, filtering and management of embeddings.
- devlooper is a program synthesis agent that autonomously fixes its output by running tests!
- Has Progress on Data, Analytics, and AI Stalled at Your Company?
- Using Xorbits Inference to Deploy Local LLMs - in 3 steps!
- GN⁺: 2023년의 AI 현황: 생성형 AI의 획기적인 해 by 맥킨지 (mckinsey.com)
- A Bicycle for the (AI) Mind: GPT-4 + Tools
- Getting good results by filtering some public datasets. You'll find lots of duplicates. Filter by instruction similarity score > .95 (cosine) using e5-large-v2.
- A Novel Approach for Anomaly Detection Using Large Language Models
- NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
- ChainForge - 프롬프트 엔지니어링을 위한 비쥬얼 프로그래밍 도구 오픈소스 (chainforge.ai)
- AWS 생성 AI 플랫폼 '세이지메이커'와 '베드락' 차이는
- CTranslate2 is a C++ and Python library for efficient inference with Transformer models.
- Custom instructions for ChatGPT
- ChatGPT Custom Instructions
- Advances in document understanding (by Google) - Visually Rich Document Understanding (VRDU) dataset
- Stability AI launches StableCode, an LLM for code generation
- S프롬프트 엔지니어링으로 재탄생하는 프로그래밍 문화
- 조직의 생성형 AI 성패, CIO 어깨에 달렸다
- gpt-llm-trainer - Simply input a description of your task, and the system will generate a dataset from scratch, parse it into the right format, and fine-tune a LLaMA 2 model for you.
- 구글, 웹 브라우저 기반 IDE ‘프로젝트 IDX’ 공개··· AI 코딩 기능 지원
- "AI 기반 빙 & 에지 출시 6개월··· 채팅 10억 건, 이미지 7.5억 개 이상 생성"
- How to fine-tune Llama 2 without writing a single line of code.
- Google PaLM & TPU 개발자들이 MatX 라는 새로운 칩 회사를 설립 (matx.com)
- 5 Surprising Stats About the State of AI in 2023
- Show GN: 프롬프트 엔지니어링으로 수능 국어 1등급에 도전하는 오픈소스 프로젝트 (github.com/NomaDamas)
- 에이수스, 엔비디아 젯슨 오린 플랫폼 기반 소형 AI컴퓨터 출시
- Ollama allows you to run open-source large language models, such as Llama 2, locally.
- Stanford's Natural Language Processing with Deep Learning covers everything from Word2Vec to RLHF!
- Statistics 110: Probability by Harvard University.
- Stable Diffusion WebUI is now on #OpenXLab with #SDXL 🥳 Thanks to #OpenXLab for the A100 GPU! 🔥
- Alibaba, 오픈소스 AI 모델 공개 (cnbc.com)
- 오픈소스 언어 모델의 현재 (twitter.com/Yampeleg)
- Fine-Tune LLaMA 2 with QLoRA
- Stable Diffusion for Audio is here 🤯
- Do Machine Learning Models Memorize or Generalize?
- Recommendation Engine: What It Is, How It Works
- 🎓 ML Courses (11K ⭐️)
- Text Split Explorer
- Text Splitter Playground
- Entity Metadata Extraction
- llama2.c for Dummies (초보자를 위한 llama2.c 가이드) (github.com/RahulSChand)
- Llama 2 Uncensored 버전을 로컬에서 실행하기 (ollama.ai)
- 소프트뱅크, 일본어 특화 AI 모델 만든다··· 오픈AI 경쟁 기관 ‘SB 인튜이션’ 설립
- "AI가 화력발전소 3개 전력 소비" 고성능일수록 전력 소비도 급증
- Supercharging AI/ML Development with JupyterLab and Docker
- 팔란티어 "전례없는 AI 수요 목격"…매출 전망 상향, 주가 2% 상승
- INT8 Quantization for x86 CPU in PyTorch
- The History of Open-Source LLMs: Early Days (Part One)(다른 내용 보기: I’m a researcher with an interest in deep learning and a passion for explaining scientific concepts to others.
- Tensor is a fundamental data structure in Machine Learning. I will clearly explain it today! 🚀
- llama2 8/7/23 Updates
- *Vector databases and why they matter in the LLM and Gen AI world*
- Evaluating LLMs as Agents
- GN⁺: GPTBot - OpenAI의 웹 크롤러 (platform.openai.com)
- vLLM & large models - Using tensor parallelism w/ vLLM & Modal to run Llama 70b
- 생성형 AI 키우는 삼성SDS...AI+클라우드로 기업 AI 시장 잡는다
- 진료 기록 써주고 환자 데이터 요약…빅테크 3사, '의료용 AI' 출사표
- 클라우드 빅3, 생성 AI에 언제 웃을까 - AWS, MS, 구글클라우드 등 분기 실적 비교
- Data engineering failure — Why is it almost impossible to meet deadlines?
- Functionary is a language model that can interpret and execute functions/plugins.
- Routers are modules that take in a user query and a set of “choices” (defined by metadata), and returns one or more selected choices.
- Deploy models painlessly
- Stanford University is offering the Large Language Models course for FREE!(CS324 - Large Language Models)
- Key-Locked Rank One Editing for Text-to-Image Personalization
- Large Language Models Explained - At a High Level
- Large language models, explained with a minimum of math and jargon
- Leveraging Machine Learning for Effective Marketing Strategy Development
- This repository contains demos I made with the Transformers library by 🤗 HuggingFace. Currently, all of them are implemented in PyTorch.
- Segment Anything as a Service
- The first iterations of ControlNet for SDXL are hitting HuggingFace 🧙♂️
- K-nearest Neighbors in Scikit-learn
- NASA-IBM, 기후 변화 연구 위한 LLM 모델 오픈소스로 공개
- 메타, 오픈소스 AI '오디오 크래프트' 출시··· "텍스트 입력만으로 음향·음악 생성"
- Function calling in Llama
- “생성형 AI만 있으면 나도 영화 감독?” 런웨이 젠2 활용한 SF 단편영화 제작기
- 파워 앱스ㆍ파워 오토메이트에서 로우코드 AI 코딩하기
- Nvidia H100 GPUs: Supply and Demand
- Neo4j Graph Store
- Do Multilingual Language Models Think Better in English?
- CoreWeave raises $2.3 billion in debt collateralized by Nvidia chips
- On-disk HNSW index for Postgres with pg_embedding
- Wow, pushing a 7b model to almost 50% accuracy on GSM8k, approaching code-davinci-002, this is significant!! Half a year ago, the best score I could get was only 27% with FlanT5 11B. Science moves really fast.
- PubMedQA - A Dataset for Biomedical Research Question Answering
- Google is about to receive the biggest update in its history. Artificial intelligence will be integrated directly into the search engine.
- AI NPC 생성 기술 인월드AI, 5천만 달러 투자 유치··· 삼성·LG도 참여
- ‘구글 어시스턴트’의 미래는 어떻게 될까?
- 엔비디아 덕분에 신데렐라가 된 'GPU 클라우드' 업체
- 카카오, 2분기 영업이익 34%↓"10월 초거대 AI 모델 공개"
- ◐ GPT-Migrate ◑ - Easily migrate your codebase from one framework or language to another.
- Securing LLM Systems Against Prompt Injection
- Using AI to Build Stronger Connections with Customers
- @arp_ai - One of the best channels for NLP! 🙏
- Revisiting DETR Pre-training for Object Detection
- 🧺 RAGstack - Deploy a private ChatGPT alternative hosted within your VPC.
- Announcing SDXL 1.0 by stability.ai
- 사전 훈련 없이도 작업 척척··· 딥마인드, 로봇 위한 ‘액션’ 모델 RT-2 공개
- IT 리더가 검토해야 하는 생성형 AI 쟁점 20가지
- 맥킨지 “기업 22% 생성형 AI 활용··· 기술, 금융, 제약 분야 등에 도입 증가”
- Introducing LeMUR, the easiest way to build LLM apps on spoken data. Search, summarize, ask questions, and generate new text, with knowledge of all your application’s spoken data.
- 서비스나우, 생성형 AI 기반 사례 요약 및 코드 생성 기능 발표
- 더 강력한 생성형 AI 규제가 필요한 이유 “결국 업체가 아닌 기업이 책임지기 때문”
- Med-Flamingo: a Multimodal Medical Few-shot Learner
- PromptTools - 🔧 Test and experiment with prompts, LLMs, and vector databases. 🔨
- GitHub CEO: AI and software development are now inextricably linked
- FLO가 비슷한 음악을 찾는 방법
- FalconLite is a quantized version of the Falcon 40B SFT OASST-TOP1 model, capable of processing long (i.e. 11K tokens) input sequences while consuming 4x less GPU memory.
- Awesome MLOps Awesome - A curated list of awesome MLOps tools.
- The state of AI in 2023: Generative AI’s breakout year
- Chapyter is a JupyterLab extension that seamlessly connects GPT-4 to your coding environment.
- Patterns for Building LLM-based Systems & Products
- 🔉 LP-MusicCaps: LLM-Based Pseudo Music Captioning(Colab 실행)
- Music To Image - Sends an audio into LP-Music-Caps to generate a audio caption which is then translated to an illustrative image description with Llama2, and finally run through Stable Diffusion XL to generate an image from the audio !
- Researchers Publish Attack Algorithm for ChatGPT and Other LLMs
- 7 Frameworks for Serving LLMs
- Introducing CM3leon, a more efficient, state-of-the-art generative model for text and images
- KoBBQ: Korean Bias Benchmark for Question Answering
- My AI work is available here: Everything I know about AI is in this file.
- The team is working on a Llama 2 variant of Giraffe and plans to release the weights for that one as well.
- LLM Reasoners is a library to enable LLMs to conduct complex reasoning, with advanced reasoning algorithms.
- Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
- datagran - No-code components + a powerful IDE + a smart AI assistant.
- Update on the NN+Gzip front.
- 🚀 Are you interested in learning more about machine learning and its applications? Do you want to watch some awesome YouTube channels that cover topics such as deep learning, natural language processing, neural networks, and more?
- Run Llama 2 on your own Mac using LLM and Homebrew
- Treating Attention Deficit Disorder in LLMs
- Optimizing latency - An exploration of ways to optimize on latency.
- "기업 55%, 새 애플리케이션 개발에 AI 우선 전략 채택" 가트너 AI 설문조사
- calamanCy is a Tagalog natural language preprocessing framework made with spaCy.
- XML Agent
- UnIVAL: Unified Model for Image, Video, Audio and Language Tasks
- Building a center of excellence for data science: Five pillars for success
- Med-Flamingo is a medical vision-language model with multimodal in-context learning abilities. This model is based on the OpenFlamingo-9B V1 model which uses the CLIP ViT-L/14 vision encoder and the Llama-7B language model as frozen backbones.
- LLM-Rec: Personalized Recommendation via Prompting Large Language Models
- 🛠️ToolBench🤖 - 🔨This project (ToolLLM) aims to construct open-source, large-scale, high-quality instruction tuning SFT data to facilitate the construction of powerful LLMs with general tool-use capability.
- ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs
- 미국 패스트푸드 체인 AI 도입 열풍.. 드라이브 스루에 속속 도입
- 🚀LLaMA2-Accessory is an open-source toolkit for pre-training, fine-tuning and deployment of Large Language Models (LLMs) and mutlimodal LLMs.
- 실무에 생성형 AI 어떻게 활용할까..AWS, 무료 및 저비용 교육 과정
- 7 free and low-cost AWS courses that can help you use generative AI
- 음성 비서와 생성AI 통합 급물살...아마존·구글 행보 구체화
- Practical AI for Instructors and Students Part 1: Introduction to AI for Teachers and Students
- llama2.rs - This is a one-file Rust implementation of Llama2.
- Llama 2: Open Foundation and Fine-Tuned Chat Models
- Gorilla: Large Language Model Connected with Massive APIs
- With @llama_index data agents + text-to-image, we can augment prompt w/ relevant context from a knowledge base! 🔎
- Handling big models for inference
- Truss - The simplest way to serve AI/ML models in production
- Python quant code from Goldman Sachs.
- 中 “AI 시장 잡자”…알리바바가 선택한 승부수는?
- 네이버 초대규모 AI, 커머스·모빌리티·금융·교육으로 확산 - 쏘카·SK C&C·한글과컴퓨터 등 하이퍼클로바X 협업 기업 줄이어
- AWS, 생성AI 모델 포트폴리오 늘린다...코히어와도 제휴
- So you want to build your own open source ChatGPT-style chatbot…
- Financial Applications of Machine Learning
- PeerDB is a Postgres-first data-movement platform that makes moving data in and out of Postgres fast and simple. It enables you to sync, transform and query data across your stores using simple SQL commands.
- 'AI 신뢰성 인증제도' 나온다
- Reasoning or Reciting? Exploring the Capabilities and Limitations of Language Models Through Counterfactual Tasks
- But what are PyTorch DataLoaders really?
- Here's how to generate your PowerPoint slides using AI (It's 100% free) 👇
- Large Language Models and Nearest Neighbors
- DeepMind just announced Med-PaLM M, a Multimodal Generative AI
- What AI can do with a toolbox... Getting started with Code Interpreter
- 압도적으로 혁신적인가? GPT-4의 비밀-2
- RT-2: New model translates vision and language into action
- Big Data Isn’t Better Data: What’s Wrong with Analytics
- LLM Attacks
- roop for StableDiffusion - This is an extension for StableDiffusion's AUTOMATIC1111 web-ui that allows face-replacement in images.
- Web Explorer - This is a lightweight app using the Web Research Retriever.
- 5 ways to Increase Statistical Power - Statistical Power in A/B testing visualized
- FacTool: Factuality Detection in Generative AI
- Tracking Anything in High Quality
- Berkeley Open-Sources AI Image-Editing Model InstructPix2Pix
- edge-tts is a Python module that allows you to use Microsoft Edge's online text-to-speech service from within your Python code or using the provided edge-tts or edge-playback command.
- ML app with streamlit in super short steps
- SAM ALTMAN SAYS SORRY, AI IS DEFINITELY DESTROYING JOBS
- Vector Store Options & Feature Support
- 7 Ways to Monitor Large Language Model Behavior - Seven ways to track the evolution of LLMs with LangKit and WhyLabs
- LLMs and the Emerging ML Tech Stack
- Hippocratic AI is the new state of the art (SOTA) model, outperforming GPT-4 on 105 of 114 healthcare exams and certifications.
- 하드웨어
- Intel's Downfall Mitigations Drop Performance Up to 39%, Tests Show
- 열흘 동안 질화갈륨 고속 충전기 7개 산 사람의 이유 있는 변명
- GN⁺: Cloudflare 가 제공하는 인터넷 속도 테스트 (speed.cloudflare.com)
- 아마존이 전세계 Arm 서버 CPU의 절반을 보유중 (theregister.com)
- “조립 좀 해 본” PC 애호가를 위한 필수 툴 7가지
- 애플이 새 아이폰용 칩을 위해 수십억달러를 절약하는 방법 (theinformation.com)
- “SSD 병목의 궁극적 해결”⋯‘광자’ PCIe 새 규격 만든다
- [비행소년] 비행기를 추적해 보자 #1
- 프레임워크 랩톱 13 리뷰 | ‘누구나’ 수리, 업그레이드해 오래 쓸 수 있는 노트북
- “메테오 레이크로 ‘AI PC’시대 열 것” 인텔 CEO 팻 겔싱어
- 읽을거리
- 부자 미국·가난한 유럽, 격차 더 커진다
- 이래도 게임 탓, 저래도 게임중독... 미디어 속 게임 수난사
- 가계부 템플릿(2023년)
- 미 연준 역사 한 방에 정리 (WSJ)
- Hubble Space Telescope
- 트위터 "이용자 연평균 소득 5천220만원…타 SNS·동영상보다↑"
- 거주자의 종합소득에 대한 소득세는 해당 연도의 종합소득과세표준에 다음의 세율을 적용하여 계산한 금액(이하 “종합소득산출세액”이라 한다)을 그 세액으로 한다.
- How Well Can You Hear Audio Quality?
- 한달에 4천을 벌어도 인생은 안 바뀌더군요.JPG
- 얼렁뚱땅 잼버리 메타버스, 국민 여러분의 세금이 '터지지 않고' 있습니다
- 나이 들수록 불행한 한국인…"월소득 500만원 넘으면 더 행복"
- 돈이 많다고 행복한 건 아니다
- NASA Plus is the latest streaming competitor
토요일, 8월 12, 2023
[B급 프로그래머] 8월 2주 소식(빅데이터/인공지능, 하드웨어, 읽을거리 부문)
(오늘의 짤방: via @miniapeur)
피드 구독하기:
댓글 (Atom)
댓글 없음:
댓글 쓰기