Architecture of OzSearch

Status Quo of BeCurio Technology
Core Technology
- Search Engine indexing Structure and Algorithm
- String Management related data structure and algorithm
- Memory / resource efficiency enhancement library
- Big Data in Memory Related technology (based on Search Engine Technology)


Index Structure of OzSearch
Through Shared Index structure, saving by more than 50% and
guaranteeing memory based search for big data.

Shared index Structure
- Small index DB →Decrease OP load → Speedy search
- The bigger the data size is, the more the resource can be saved

Operation Algorithm and inverted file

Memory data reduction technology
Memory (data type simplification + byte + slimed data ST) – I/O{usage frequency grouping + data reduction}

Memory Reduction: Data Structure and Capacity

Inverted file size & capacity comparison

Comparison test with the same data set
Test Procedure
- Data download
- Not bi-gram but full n-gram indexing without compression
- Comparison factors
- Indexing time, index size, search speed
- Number of searched items
- Retrieval of the target item
Data Source Wikipedia
Data size : 10,000,000 (50GB text)
Avg. Record length : 400 bytes
Data format : Title

Comparison Table with Solr
