Search Patterns
Comments
 Share
The version of the browser you are using is no longer supported. Please upgrade to a supported browser.Dismiss

 
$
%
123
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
Still loading...
ABCDEFGHIJKLMNOPQRSTUVWX
1
Pattern familyPatternShould find text Nstring NWorks in test (Y / N)Works in PN (Y / N)DescriptionNotesType (developer use)Developer notes
2
StringA String is defined as anything that is not necessarily a complete word. So, if you search for του with "Substring" selected in PN, you will find τοὺς and αὐτοῦ and αὐτοὺς. If you search for του with "Word/Phrase" selected you will find τοῦ and του (i.e. τινος). Word/Phrase assumes the presence of boundaries at either end of the string, whether you enter them or not; thus, if you search for #του# with "Substring" selected in PN the results should be (almost exactly) the same as those of a search for του with "Word/Phrase" selected.UNLESS NOTED OTHERWISE ALL SEARCHES ARE ASSUMED TO BE CASE- AND DIACRITIC-INSENSITIVE
3
οιν ψισP.Lond. VII 2193 lin4κοινὸν ... ὙψίστουYY substring (ngram)Native Solr support
4
#κοι #υψιP.Lond. VII 2193 lin4κοινὸν ... ὙψίστουYYAny instances of a word beginning with the string 'κoι' and a word beginning with the string 'υψι' occurring together in the same document
5
γγελιας# γους#P.Lond. VII 2193 lin12παραγγελίας ... [σ]υνλ̣όγουςYYAny instances of a word ending with the string 'γγελιας' and a word ending with the string 'γους' occurring together in the same document
6
#προYYAny instances of a word beginning with the string 'προ'
7
του#YYAny instances of a word ending with the string 'του' (including the word 'του')
8
#του#YYAny instances of the string 'του' that are both preceded and followed by a word-boundary (i.e., the word 'του')
9
#παραγγελίας# #συνλόγους#παραγγελίας, [σ]υνλ̣όγους12YYAny instance of the string 'παραγγελίας' occurring together with any instance of the string 'συνλόγους', both demarcated by word boundaries, within the same documentNote that AND is the default operator
10
#παρα λόγους#παραγγελίας, [σ]υνλ̣όγους12YYAny instance of a word beginning with the string 'παρα' and any instance of a word ending with the string ᾽λόγους᾽occurring together in the same document Note that AND is the default operator
11
ουσι# #τωιἀ̣ν̣ήκουσι τῶι11YYAny instance of a word ending with the string 'ουσι' occurring together with a word beginning with the string 'τωι' in the same document
12
εως# #υπαβασ̣ιλ̣έ̣ω̣ς̣· ὑπακούσειν10YYAny instance of a word ending with the string 'εως' occurring together with a word beginning with the string 'υπα' in the same document
13
#προ OR #μητρικYYAny instance of a word beginning with the string 'προ' or a word beginning with the string 'μυτρικ'substring(ngram) with operator
14
#προ OR #μητρικ NOT #βασιλΥYAny instance of a word beginning with the string 'προ' or the string ᾽μητρικ' found in a document that does *not* contain a word beginning with the string 'βασιλ'.New functionality as of 2011.10.03
15
και AFTER συναγωγας AFTER συνλογουςκαι, και 12 and 13NNAny instance of the string 'και' occurring at any point in the document following an instance of the string 'συναγωγας', itself following an instance of the string 'συνλογους'-->THENordered phrase Two strategies possible here: (i) extend QueryParser to use the Lucene SpanNearQuery class (http://lucene.apache.org/java/2_9_0/api/ core/org/apache/lucene/search/spans/SpanNearQuery.html), which is inherently directional (ii) regexes, as below
16
και AFTER συναγωγας BEFORE #αποκαι12NNAny instance of the string 'και' occurring at any point in the document between the first instance of the string 'συναγωγας' and the last instance of a word beginning with the string ᾽απο'-->THENordered substringCreate an untokenized (=KeywordTokenizer) field and search using regexes of the type \bτου\b.*\bανδ
17
#του# ΤΗΕΝ #ανδΝNAny substring 'του' (equivalent here to a word, because delimited at both ends) occuring at any point in the document prior to the last instance of a word beginning 'ανδ'BEFORE, IMMEDIATELY-BEFORE, AFTER, IMMEDIATELY-AFTER keywords all unimplemented
18
και AFTER (συναγωγας BEFORE #απο)και, και12 and 13NNAny instance of the string 'και' occurring at any point in the document after the first instance of the string 'συναγωγας' which itself occurs before a word beginning with 'απο'-->THENordered substring with precedence operatorsProbably no need for precedence operators now that the THEN synax is preferred.
19
του# NOT #του#YYAny instance of a word ending with the string 'του' occurring in a document that does *not* contain the word 'του' (more precisely, that does not contain any instance of the string 'του' both preceded and followed by a word-boundary.Revisit: should find doc that also contains 'του'; just dont want to highlight that stringlookThis could be implemented in two stages: first, performing a term query to get all matching terms (nb: has full java regex support); then a standard solr query for matching these terms
20
σπεν* ευχ*σ̣π̣έν̣δοντ̣ες εὐχέσθωσαν9YYAny instance of the string 'σπεν', optionally followed by any number of characters, occurring together in the same document with 'ευχ', optionally followed by any number of characterswildcard (ngram)Native Solr support
21
Word/PhraseWord/Phrase is defined as anything that is not simply a String (see above). The search engine defaults to Words rather than Strings; this in some of the examples below, you will see that it is necessary to distinguish explicitly between Words and Strings
22
"των ανδρων"τῶν ἀνδρῶν6ΥYAny instance of the word 'των' followed immediately by any instance of the word 'ανδρων'; note that "των ανδρων" + Word/Phrase should generate the same results as "#των# #ανδρων#" + SubstringCheck if the example given corresponds to the general "a b" pattern given on wiki:searchPatterns; if so, functioning in Word/Phrase search, which is presumably as it should be.phraseNative Solr support
23
24
Lemmatized
25
διαγορεύωδιαγ̣[ορ]εύειYYAny form of the verb διαγορεὺω; be sure to select the radio button "Lemmatized"; or, search lem:διαγορεύωlemmatisedCurrently supported through two-stage query process
26
Lemmatized combinationsResults should be identical regardless of whether lemmatized search is indicated using radio buttons or 'lem:' string
27
lem:λόγιος lem:τόποςλόγιον τοῦ 6ΥYAny form of the adjective λόγιος occurring together with any form of the noun 'τόπος' within the same documentNew functionality as of 2011.10.03lemmatised
28
lem:λόγιος lem:τόποςλόγιον τοῦ 6ΥYAny form of the adjective λόγιος occurring together with any form of the noun 'τόπος' within the same documentNew functionality as of 2011.10.03
29
λόγιος τουλόγιον τοῦ 6??see note in next columnsc: with Lemmatized Search selected w/radio buttons; right now this returns a lemmatized search for λογιος and a substring search for του
30
lem:λόγιος AND lem:τόποςλόγιον ... τόπου6YYAny form of the adjective λόγιος occurring together with any form of the noun τόπος in the same documentNew functionality as of 2011.10.03lemmatised with operator
31
lem:λόγιος AND transcription_ngram_ia:(οπο)λόγιον ... τόπου6YYAny form of the adjective λόγιος occurring together with any instance of the string 'οπο' within the same document, case and diacritic insensitiveNew functionality as of 2011.10.03; change transcription_ngram_ia: --> string:lemmatised and substring
32
lem:λόγιος AND τόπου
λόγιον ... τόπου6YYAny form of the adjective λόγιος occurring together with any instance of the word 'τόπου' within the same document, case and diacritic insensitiveBecause lemmatised searching is necessarily case- and diacritic-insensitive, the search for τόπου here is also case- and diacritic-insensitive; New functionality as of 2011.10.03
33
lem:ἀγαθός OR transcription_ngram_ia:(μητρικ)ἀγαθῆι3ΥYAny form of the adjective 'ἀγαθός' or any instance of the string 'μητρικ'New functionality as of 2011.10.03; change transcription_ngram_ia: --> string:lemmatised and substring with operator
34
lem:ἀγαθός OR μητρικηνἀγαθῆι3ΥYAny form of the adjective 'ἀγαθός' or any instance of the word 'μητρικαν'New functionality as of 2011.10.03
35
lem:πατήρ NOT transcription_ngram_ia:(μητρικ)πατὴρ21ΥYAny form of the word 'πατήρ' found in a document that does *not* contain the substring 'μητρικ'New functionality as of 2011.10.03; change transcription_ngram_ia: --> string:
36
lem:πατήρ NOT μητρικηνπατὴρ21ΥYAny form of the word 'πατήρ' found in a document that does *not* contain the word 'μητρικην'.New functionality as of 2011.10.03
37
lem:νομος AFTER lem:αγαθοςνόμος4NNAny form of the noun 'νόμος' occurring at any point in a document after the first instance of the adjective 'ἀγαθός'lemmatised and ordered
38
lem:τυχη IMMEDIATELY-AFTER lem:αγαθοςτύχηι3NNAny form of the noun 'τύχη' immediately preceded by any form of the adjective 'ἀγαθός' --> THEN
39
#τρια WITHIN n words of lem:αγαθοςτριάκοντα2NNAny instance of a word beginning with the string 'τρια' found within n words of any form of the adjective 'ἀγαθός'lemmatised and substring with
40
#τρια BEFORE lem:αγαθοςτριάκοντα2NNAny instance of a word beginning with the string 'τρια' occurring at any point in a document prior to any form of the adjective 'ἀγαθός'lemmatised with substring and ordered
41
#των# IMMEDIATELY-BEFORE lem:ανηρτῶν6NNAny instance of the string (here=word, because delimited by word boundaries) 'των' followed immediately by any form of the noun 'ἀνὴρ'--> THEN
42
#τυχ IMMEDIATELY-AFTER lem:αγαθοςτύχηι3NNAny word beginning with the string 'τυχ' immediately preceded by any form of the adjective 'ἀγαθός' --> THEN
43
μος# AFTER lem:αγαθοςνόμος4NNAny word ending with the string 'μος' occurring at any point in a document after the first instance of the adjective 'ἀγαθός'
44
lem:αυτος AFTER #επι# AFTER μηδενιαὐτῶν13NNAny form of the pronoun αὐτός occurring at any point after the string (here=word) 'επι', itself follows the first instance in the document of the string μηδενι.
45
lem:αυτος AFTER #επι# BEFORE μηδενιαὐτοῖς12NNAny form of the pronoun αὐτός occurring at any point after the string (here=word) 'επι', itself precedes the last instance in the document of the string μηδενι.--> THEN
46
(lem:αυτος AFTER #επι#) BEFORE μηδενιαὐτῶν 13NNAny form of the pronoun αὐτός occurring at any point in the document after the string (here=word) 'επι', but before the last instance of the string μηδενι.--> THENlemmatised with substring and ordered and with precedence operators
47
Double hitsThe point of these tests is that they match variant forms and supplied forms
48
σχισμσχί<σ>ματα13ΝΝAny variant of the substring 'σχισμ'cross-tagThe difficulty here is of <reg><orig> variants. It might be possible to get around this by using the SpellCheck component, but this raises questions of consistency: as the range of variability widens, so do the number of false positives (from the user's perspective). Possible need to make this user-configurable, so that either exact or expanded search is possible - with only reg forms being indexed into the transcription fields, but reg and orig forms both being indexed into a dictionary field? A couple of questions present themselves: how close are reg and orig strings likely to be? How many distinct reg/orig pairs are there? If the resulting dictionary field is relatively small, then accuracy will be high; if otherwise, otherwise ...
49
σχιμσχί<σ>ματα13ΝΝAny variant of the substring 'σχιμ'
50
πετεσουΠετε{ετε}σοῦ[χον6ΝΝAny variant of the substring 'πετεσου'
51
πετεετΠετε{ετε}σοῦ[χον6Ν.Α.N.A.Any variant of the substring 'πετεετ'The string-test given is not in fact found in the document
52
ηνα πoσμῆνα πόσι[ν]8NN.A.cross-tag in apparatus
53
ηινα πoσμηινα πόσι[ν]8NN.A.Match in apparatus
54
Wildcards
55
η?ουμενονἡγούμενον6YYAny substring consisting of the 'η' character, followed by any single character (including whitespace characters), followed by the string 'ουμενον'WildcardNative Solr support
56
η??υμενονἡγούμενον6YYAny substring consisting of the 'η' character, followed by any two characters (incl. whitespace characters), followed by the string 'υμενον'
57
η???μενονἡγούμενον6YYAny substring consisting of the 'η' character, followed by any three characters (incl. whitespace characters), followed by the string 'μενον'
58
η*ενονἡγούμενον6YYAny substring consisting of the 'η' character, followed by any number of any characters (incl. whitespace characters), followed by the string 'ενον'
59
60
συναγωγὴ WITHIN n words of αποσυναγωγὰς καὶ ἀποδημί̣[ας], συναγωγὰς ... ἀπ[ο]χωρήσε[ιν12 and 12 - 14ΥNAny instance of the word 'συναγωγὴ' occurring within n words of the word 'ἀπό' Proximity phraseNative Solr support
61
συναγωγὴ WITHIN n words of #αποσυναγωγὰς καὶ ἀποδημί̣[ας], συναγωγὰς ... ἀπ[ο]χωρήσε[ιν12 and 12 - 14ΝNAny instance of the word 'συναγωγὴ' occurring within n words of a word beginning with the string 'απο'Proximity substringNative Solr support is for phrase proximity searching only. Possibly these would need to be handled by regexes of the type: συναγωγη(.῝\b){n}\bαπο?
62
συναγωγὴ WITHIN n words of μιας#συναγωγὰς καὶ ἀποδημί̣[ας]12ΝNAny instance of the word 'συναγωγὴ' occurring within n words of a word ending with the string 'μιας'
63
συναγωγὴ WITHIN n words of #μηδε#συναγωγὰς ... μηδὲ 12 and 13ΝNAny instance of the word 'συναγωγὴ' occurring within n words of the substring (here,=word) μηδε
64
συναγωγὴ WITHIN (4, 8) words of #μηδεσυναγωγὰς ... μηδενὶ, συναγωγὰς ... μηδὲ12 and 13ΝNAny instance of the word 'συναγωγὴ' occurring within 4 - 8 words of a word beginning with the substring 'μηδε'
65
#μηδ NOT WITHIN 3 words of #απομηδὲ13ΝNAny word beginning with the substring 'μηδ' that does *not* occur within three words of a word beginning with the substring 'απο'.Negated proximity substring
66
#μηδ NOT WITHIN 4 words of lem:αυτοςμηδʼ14ΝNAny word beginning with the substring 'μηδ' that does *not* occur within four words of any form of the pronoun 'αὐτὀς᾽Negated proximity lemma and substring
67
βa THEN λ WITHIN 4 chars THEN υσ WITHIN 4
Any string of characters conforming to the pattern 'βα', followed by the 'λ' character within the four characters subsequent to the 'α' character, followed by the character sequence 'υσ' within four characters (or, possibly, words) of the 'λ' character.The unit following the final digit is not specified; it might be either words or (more likely) characters.Ordered substring (with characters rather than words as the unit)
68
Metadata combinationsA user-end query syntax needs to be worked out for this.In particular, how do users specify that they wish to search BOTH APIS *and* HGV metadata? Is the 'OR' syntax sufficient?
69
translation:gymnasiarchYYAny document, the translation of which contains the word 'gymnasiarch'Metadata phrase
70
meta:sheepYYAny document, the metadata of which contains the word 'sheep' (though see the section-heading note on the syntax for expressing this)This should perform a metadata search across *both* HGV and APIS collections.
71
APIS:witnesses AND #μαρτυρsee notesee noteAny document for which the APIS metadata contains the word 'witnesses' and the contents contain a word beginning with the substring 'μαρτυρ'.This *does* work using the extended syntax APIS:witnesses string:#μαρτυρMetadata phrase and substring
72
APIS:slave AND #προγεγραμμsee notesee noteAny document containing a word beginning with the substring #προγεγραμμ, the APIS metadata of which contains the word 'slave'.Works using the extended syntax, as above
73
HGV:Zeus AND #προγεγραμμπρο[γ]εγ[ρ]αμμένου7see notesee noteAny document containing a word beginning with the substring #προγεγραμμ, the HGV metadata of which contains the word 'Zeus'.Works using the extended syntax, as above
74
(APIS: Ägypten OR HGV: Ägypten) AND #προγεγραμμsee notesee noteAny document containing a word beginning with the substring #προγεγραμμ, the APIS or HGV metadata of which contains the word 'slave'.Works using the extended syntax, as aboveMetadata phrase with substring, with operators
75
APIS: Ägypten OR (HGV: Ägypten AND #προγεγραμμ)see notesee noteAny document for which the APIS metadata contains the word 'Ägypten', or for which both the HGV metadata contains the word 'Ägypten' and the contents contain a word beginning with the substring ' προγεγραμμ'.Works using the extended syntax, as above
76
(HGV:Ägypten NOT APIS:receipt) AND #και#see notesee noteAny document, the HGV metadata of which contains the word 'Ägypten', but for which the APIS metadata does *not* contain the word 'receipt', and the contents of which contain the substring (here=word) 'και'.Works with the syntax (HGV:Egypt NOT APIS:receipt) string:#και#
77
HGV:Ägypten NOT (APIS:receipt AND #και#)see notesee noteAny document, the HGV metadata of which contains the word 'Ägypten', but for which the APIS metadata does *not* contain the word 'receipt', nor do the contents contain the substring (here=word) 'και'.Works using syntax HGV:Egypt NOT(APIS:receipt string:#και#)
78
Other
79
μοσονεθεννόμος ὃν ἔθεντο (P.Lond. VII 2193 line 4)4NNAny string of letters 'μοσονεθεν' regardless of spacing or punctuationCross-token
80
81
82
OLD PATTERNS
83
των IMMEDIATELY-BEFORE #ανδτῶν ἀνδρῶν6NNAny word ending with the string 'των' and immediately preceding a word beginning with the string 'ανδ' That is to say, only whitespace or punctuation characters can separate them.
84
#ανδ AFTER #του#του, του4 and 6ΝNAny word beginning with the string 'ανδ' and occurring at any point in the document after the first instance of the substring (here = word) 'του'
85
#ανδ IMMEDIATELY-AFTER τωντῶν ἀνδρῶν6ΝNAny word ending with the string 'των' and immediately preceding a word beginning with the string 'ανδ'. That is to say, only whitespace or punctuation characters separate them.
86
και AFTER συνλογουςκαι, και, και12 and 13NNAny instance of the string 'και' appearing at any point in the document after the first instance of the string 'συνλογους'.
87
lem:λόγιος AND οποNN
88
lem:αγαθος OR μητρικ
89
lem:πατήρ ΝΟΤ μητρικ
90
91
92
93
94
95
96
97
98
99
100
Loading...
 
 
 
TimsCrib
SearchDocumentation
Tidied Search Patterns