Toggle navigation
Email or Username
Password
Remember
Login
Register
|
I forgot my password
Questions
Unanswered
Tags
Users
Ask a Question
Fuzzy search?
0
like
0
dislike
13
views
There are two rows, the 1st short 1-3 words, the second-longest 10-20 words, we have to determine is whether the first string into the second or how many percent it is there. Advise algorithms :)
algorithms
text-processing
asked
Mar 21, 2019
by
Zubchick
|
13
views
answer
comment
Please
log in
or
register
to add a comment.
Please
log in
or
register
to answer this question.
7
Answers
0
like
0
dislike
I once wrote a thesis on this; I got that it is best to compare the Russian words along the length of the maximal common prefix (as a percentage of the lesser of the length of words should be above threshold). Comparisons — do you compare words in pairs of rows, and output function of the similarity using the distance between the similar words.
answered
Mar 23, 2019
by
xmoonlight
ask related question
comment
Please
log in
or
register
to add a comment.
0
like
0
dislike
try the Levenshtein distance schitat:
http://ru.wikipedia.org/wiki/%D0%A0%D0%B0%D1%81%D1%81%D1%82%D0%BE%D1%8F%D0%BD%D0%B8%D0%B5_%D0%9B%D0%B5%D0%B2%D0%B5%D0%BD%D1%88%D1%82%D0%B5%D0%B9%D0%BD%D0%B0
example
\r
bytes.com/topic/python/answers/580959-fuzzy-string-comparison
answered
Mar 23, 2019
by
Nicolette
ask related question
comment
Please
log in
or
register
to add a comment.
0
like
0
dislike
if(longStr.Contains(shortStr))
answered
Mar 23, 2019
by
Zubchick
ask related question
comment
Please
log in
or
register
to add a comment.
0
like
0
dislike
There are many different distances between the words, I would break the phrase into words and would have taken the average of the maximum obtained measures of pairwise matching words.
answered
Mar 23, 2019
by
bear11
ask related question
comment
Please
log in
or
register
to add a comment.
0
like
0
dislike
You can compare the method of trigrams. Will give a certain result, even if the words with different endings, etc.
answered
Mar 23, 2019
by
Zubchick
ask related question
comment
Please
log in
or
register
to add a comment.
0
like
0
dislike
Traditionally the Levenshtein distance.
\r
But I would recommend ispolzvat longest common subsequence. Thus, it is possible to introduce some penalty for the gaps between words.
answered
Mar 23, 2019
by
Zubchick
ask related question
comment
Please
log in
or
register
to add a comment.
0
like
0
dislike
How to determine the similarity of two strings?
answered
Mar 23, 2019
by
bear11
ask related question
comment
Please
log in
or
register
to add a comment.
Related questions
0
like
0
dislike
1
answer
How to compare two arrays of text?
asked
May 21, 2019
by
w0rkaholic
algorithms
text-processing
0
like
0
dislike
3
answers
What are patterns or algorithms for text processing?
asked
Apr 9, 2019
by
27cm
html
algorithms
text-processing
0
like
0
dislike
5
answers
The algorithm for comparison of text strings?
asked
Mar 20, 2019
by
Xp0M0u
algorithms
text-processing
0
like
0
dislike
2
answers
How we can implement fuzzy search in string?
asked
Jun 1, 2019
by
Zhandos
algorithms
0
like
0
dislike
2
answers
Search for files containing the Cyrillic alphabet?
asked
Apr 9, 2019
by
AVTOGRAD
text-processing
text-editors
Most popular tags
javascript
php
css
html
jquery
wordpress
python
linux
web-development
mysql
android
windows
java
layout
c#
computer-networks
node.js
cpp
iron
yii
vue.js
1C-Bitrix
react
laravel
django
nginx
system-administration
search-engine-optimization
api
ubuntu
the-it-education.
ajax
sql
programming
hosting
cms
design
apache
google-chrome
bootstrap
Vkontakte
macos
google
network-administration
git
laptops
algorithms
regular-expressions
unity-game-engine
email
angular
database
network-equipment
software
wooсommerce
debian
.net
ios
information-security
video
law-in-it
browsers
books
parsing
wi-fi
game-development
career
htaccess
postgresql
telegram
mikrotik
mobile-development
ruby-on-rails
the-domain-name-system
modx
Yandex
c
json
opencart
Habr
freelance
vpn
asp.net
windows-server
symfony
bots
hard-drives
math
qt
DIY
audio
frontend
payment-system
bash
electronics
gulp.js
user-interface
docker
online-shopping
110,608
questions
257,186
answers
0
comments
32,755
users