![]() ![]() I use OpenAI’s CLIP to perform a semantic search. The background images are pulled from two sources, the Wikimedia Commons and the OpenImages dataset. In the age of the Internet, the term meme has been narrowed to mean a piece of content, typically an image with a funny caption, that’s spread online via social media. The Wiktionary defines the word meme as “any unit of cultural information, such as a practice or idea, that is transmitted verbally or by repeated action from one mind to another in a comparable way to the transmission of genes.” The term originated in Richard Dawkins’ book, The Selfish Gene. Meme by AI-Memer, Image by Atsuko Sato, Caption by OpenAI GPT-3, License: CC BY-SA 4.0 What are memes, again? The user selects the best caption to create the new meme, which can be downloaded. Either the GPT-3 model from OpenAI or the GPT-Neo model from EleutherAI is used to generate 10 possible captions. The user checks out the top 10 images that match the query and selects their favorite. I then perform a semantic search on the images. A semantic search looks for matching concepts, not just a word search. I use the CLIP encoders from OpenAI to first perform a semantic search on the text descriptions. ![]() Both datasets have corresponding text descriptions of the images. The system then checks for matching images in Wikimedia Commons and the OpenImages dataset. The user starts by entering a search query to find a background image, like “apple pie”. ![]() AI-Memer Components, Diagram by Author, pie photo by W.carter ![]()
0 Comments
Leave a Reply. |