The Frequent Words Problem: Difference between revisions

From Algorithm Wiki
Jump to navigation Jump to search
No edit summary
No edit summary
 
(One intermediate revision by the same user not shown)
Line 6: Line 6:
== Parameters ==  
== Parameters ==  


n: length of string
$n$: length of string


k: length of words
$k$: length of words


sigma: size of alphabet
$\sigma$: size of alphabet


== Table of Algorithms ==  
== Table of Algorithms ==  
Line 20: Line 20:
|-
|-


| [[Naive solution (The Frequent Words Problem The Frequent Words Problem)|Naive solution]] || 1940 || $O(n)$ || $O(max(n, sigma^k)$) auxiliary || Exact || Deterministic ||   
| [[Naive solution (The Frequent Words Problem The Frequent Words Problem)|Naive solution]] || 1940 || $O(n)$ || $O(max(n, \sigma^k)$) || Exact || Deterministic ||   
|-
|-
| [[Rabin Karp (The Frequent Words Problem The Frequent Words Problem)|Rabin Karp]] || 1987 || $O(n)$ || $O(max(n, sigma^k)$) auxiliary? || Exact || Deterministic ||   
| [[Rabin Karp (The Frequent Words Problem The Frequent Words Problem)|Rabin Karp]] || 1987 || $O(n)$ || $O(max(n, \sigma^k)$)? || Exact || Deterministic ||   
|-
|-
|}
|}
Line 29: Line 29:


[[File:The Frequent Words Problem - Time.png|1000px]]
[[File:The Frequent Words Problem - Time.png|1000px]]
== Space Complexity Graph ==
[[File:The Frequent Words Problem - Space.png|1000px]]
== Time-Space Tradeoff ==
[[File:The Frequent Words Problem - Pareto Frontier.png|1000px]]


== References/Citation ==  
== References/Citation ==  


http://bioinformaticsalgorithms.com/data/debugdatasets/replication/FrequentWordsProblem.pdf
http://bioinformaticsalgorithms.com/data/debugdatasets/replication/FrequentWordsProblem.pdf

Latest revision as of 09:12, 28 April 2023

Description

Given a string of length $n$ and in input integer $k$, determine the most frequent $k$-mers in the string, i.e. the most frequent words of length $k$.

Parameters

$n$: length of string

$k$: length of words

$\sigma$: size of alphabet

Table of Algorithms

Name Year Time Space Approximation Factor Model Reference
Naive solution 1940 $O(n)$ $O(max(n, \sigma^k)$) Exact Deterministic
Rabin Karp 1987 $O(n)$ $O(max(n, \sigma^k)$)? Exact Deterministic

Time Complexity Graph

The Frequent Words Problem - Time.png

References/Citation

http://bioinformaticsalgorithms.com/data/debugdatasets/replication/FrequentWordsProblem.pdf