New Ideas
Geek Ideas” is “Lexsense” own creative tab. The tab is used to share -with our followers- creative ideas we are looking to develop in the future. Contributions are highly welcomed. Anyone who would like to contribute, will have their name mentioned on our contribution page:
Geek idea 1:
Automated Article Content Summarization & Topic Identification System
Overview: The idea here is to create an NLP-based tool that reads a published article and automatically identifies the key topics, themes, and a succinct summary of what the article is about based on its content, title, keywords or tags. It could handle academic papers, news articles, or even blog posts. To achieve this, we might use the following python features: Topic Modeling, Text Summarization, and Named Entity Recognition (NER). The system would be able to extract key topics and provide a concise summary of the article to give readers a quick understanding of the article’s core subject.
Why it’s a geeky idea:
Advanced NLP Techniques: You’ll be working with techniques like Topic Modeling (e.g., LDA or BERTopic), Text Summarization (e.g., extractive or abstractive methods), and NER to automatically extract the essence of a text.
Real-World Application: This could be super useful for academic researchers, news aggregator, or even social media platforms where users often want a quick idea of what an article is about before deciding whether to engage with it.
Complexity: The system will need to handle nuances in language, extract contextual meaning, and be able to distinguish between various levels of detail (e.g., identifying the difference between an academic article, news piece, or opinion article).
- Data Innovation Summit – Hybrid, Stockholm
- Data Innovation Summit MEA – Adnec Centre, Abu Dhabi
Geek idea 2:
Smart Image Annotation & Retrieval System
A platform where users can upload images, annotate them with text and voice, and retrieve them using search or voice commands.
Tech Stack & Features
Image Annotation
Text Annotations
Store images in Firebase Storage / AWS S3.
- Store metadata (title, tags, descriptions) in a database (PostgreSQL, MongoDB, or Firebase Firestore).
- Use a form in the UI for users to manually add text annotations.✅ Voice Annotations
- Record voice in the UI (use Web Audio API for browsers or React Native Audio Recorder for mobile).
- Convert speech to text using OpenAI Whisper API or Google Speech-to-Text API.
- Store both the original voice file and the converted text annotation.
AI Translation & Accessibility
Automatic Translation
Use Google Translate API / DeepL API / OpenAI API to translate text annotations.
Store translations in the database alongside the original text.✅ Text-to-Speech (TTS)
Convert translated text into speech using Google TTS API, ElevenLabs, or OpenAI.
Store and let users play the translated voice output.✅ OCR Integration (Text Extraction from Images)
Use Tesseract OCR (open-source) or Google Vision API to extract text from images.
Automatically translate extracted text using AI translation APIs.
3. Smart Search & Retrieval✅ Keyword-Based Search
Store tags, descriptions, and metadata in a structured way for search.
Use Elasticsearch, PostgreSQL full-text search, or Firebase Firestore search.✅ Voice Search
Users speak a query → Convert to text with Google Speech-to-Text API → Match with stored image descriptions.✅ Image Recognition (Auto-tagging & Reverse Search)
Use Google Vision API or OpenAI CLIP to auto-label images.
Enable reverse image search (find similar images based on content).Dagstuhl-Seminar – Natural Language Processing for Mental Health 31 Aug 2025 – 05 Sep 2025 • Schloss Dagstuhl – Wadern, Germany
User Interface & Deployment
Web & Mobile App Development
- Frontend: React.js (for web) or React Native (for mobile).
- Backend: FastAPI (Python) or Express.js (Node.js).
✅ Cloud Storage & Hosting
Database: PostgreSQL (Supabase), MongoDB (Atlas), or Firebase Firestore.
Store images in AWS S3 / Firebase Storage.
Host the backend on Firebase Functions, AWS Lambda, or Vercel.
Submit A New Idea to Develop
Submitting a new ideas for development on Lexsense.net is open to all. All submissions are reviewed by a developer before their addition. We reserve the right to revise the data or decline the publication of a particular event. Once a project idea has been added to the portal, the original developer’s name will be published and shared online. Contributers can join online to particpate in the project.
To add an entry for an event related to NLP and Data Science, email the information listed below to editors[@]lexsense.net:
- Project name;
- The goal to reach.
- Environment;
- Github URL
Our Peer to Peer Reviewer
Some Project Ideas
Project Title:
“Emotion-Aware Chatbot for Mental Health Support”
Project Title:
“Multi-Lingual Hate Speech Detection Using Transformer-Based Models”
Prominent Reviewrs in Linguistics & NLP
“Seriously magic.”
Expert in linguistic typology, NLP, and language modelling. Editor & frequent reviewer in Computational Linguistics.
-Chakir Mahjoubi
“My favorite tool!
Known for work in computational semantics and CCG grammar. Longstanding contributor to linguistic theory in NLP.“
–Mark Steedman
“Game-changer.”
Researches semantics, representation learning, and neural NLP. Active reviewer for ACL, NeurIPS, and TACL..”
–Ellie Pavlick