Algorithms, Data Structure, System Design Problems

Saturday, 26 April 2025

Vector Database

Vector databases are specialized databases designed to store, manage, and search high-dimensional vectors—often used in machine learning, artificial intelligence, and especially in applications like:

Semantic search
Recommendation systems
Image, video, or audio similarity
Natural language processing (e.g., embeddings from models like BERT or OpenAI's models)

🧠 What is a "vector" in this context?

A vector is basically a list of numbers that represents data in a numerical format. For instance, a sentence can be turned into a vector using an embedding model, like OpenAI’s embedding models or word2vec.

Example vector:

[0.21, -0.53, 0.88, ..., 0.05]

These vectors are often hundreds or thousands of dimensions long.

⚙️ How vector databases work

They use Approximate Nearest Neighbor (ANN) algorithms to find similar vectors quickly. This is key when you're doing things like:

"Find the most similar document to this one"
"Which image looks closest to this?"

Popular ANN algorithms:

HNSW (Hierarchical Navigable Small World)
IVF (Inverted File Index)
PQ (Product Quantization)

🔥 Popular Vector Databases

Pinecone – Fully managed, scalable, simple to use
Weaviate – Open-source with built-in ML features
Milvus – High-performance and scalable, also open-source
FAISS (by Meta) – Library for similarity search (not a full database, but often used with others)
Qdrant – Open-source, supports filtering and metadata
Chroma – Lightweight and often used for LLM apps

Sunday, 2 March 2025

Multithreading and Concurrency details:

Multithreading and Concurrency is one of the most important topic for Senior Engineering Interviews. I mostly work with these two languages, so here are some great articles/repos to help master concurrency and its nuinances.

𝐉𝐚𝐯𝐚 🚀
1. Concurrency Series By Baeldung: https://lnkd.in/gBtMG6Mk

2. Proving Concurrency Requirement: https://lnkd.in/gwvvBWxY

3. Concurrency Series by Jenkov: https://lnkd.in/gaWZUj3W

𝐆𝐨 🚀

1. Utlimate Concurreny Guide on Github: https://lnkd.in/gRU-Z-WS
2. Concurrency Patterns: https://lnkd.in/g6nq6ZRP

If you want to build real systems like Redis, Kafka, DNS and a SqlLite Database yourself checkout these best of a kind tutorials.

1. Build Your Own Redis: https://lnkd.in/gz92ygFH
2. Build Your Own Kafka: https://lnkd.in/gm58s8CX
3. Build Your Own Http Server: https://lnkd.in/g_VxvcUN
4. Build Your Own DNS Server: https://lnkd.in/g7iM2F69

Plus amazing tutorials for all languages like Go, Rust, Java and Python.

Bonus:
Lock Free Algorithms:
1. https://lnkd.in/gTwk6Cpk
2. https://lnkd.in/gYVSVska

Wednesday, 26 February 2025

System design cheatsheet and resources

• 𝗦𝘆𝘀𝘁𝗲𝗺 𝗗𝗲𝘀𝗶𝗴𝗻 𝗙𝘂𝗻𝗱𝗮𝗺𝗲𝗻𝘁𝗮𝗹𝘀:

Important things