'core api' 태그의 글 목록

[elasticsearch] Core API - Search

Elastic/Elasticsearch 2013. 4. 16. 18:43

본 문서는 개인적인 테스트와 elasticsearch.org 그리고 community 등을 참고해서 작성된 것이며,

정보 교환이 목적입니다.

잘못된 부분에 대해서는 지적 부탁 드립니다.

(예시 코드는 성능 및 보안 검증이 되지 않았습니다.)

[elasticsearch API 리뷰]

원문 링크 : http://www.elasticsearch.org/guide/reference/api/search/

[Filter]

- http://www.elasticsearch.org/guide/reference/api/search/filter/

- 실행한 query 결과를 바탕으로 filtering 할 수 있는 옵션

[Rescore]

- http://www.elasticsearch.org/guide/reference/api/search/rescore/

- search_type 이 scan 또는 count 일 경우 실행 되지 않음

- query 와 filter 를 통해서 결과를 return 함.

[Highlight]

- http://www.elasticsearch.org/guide/reference/api/search/highlighting.html

- store : yes 와 term_vector : with_positions_offsets 설정이 되어 있어야 함.

- query 생성 시 아래와 같은 속성으로 설정

highlight

.pre_tags

.post_tags

.tags_schema

.fields

[Highlighted Fragments]

{
    "query" : {...},
    "highlight" : {
        "order" : "score",
        "fields" : {
            "content" : {"fragment_size" : 150, "number_of_fragments" : 3}
        }
    }

}

{
    "query" : {...},
    "highlight" : {
        "fields" : {
            "_all" : {},
            "bio.title" : {"number_of_fragments" : 0}
        }
    }
}

- highlight character size = fragment_size (기본 100)

- highlight size = number_of_fragments (기본 5)

- number_of_fragments=0 이면 highlight 된 전체 내용을 return 함.

- boundary characters 는

. boundary_chars 와 boundary_max_scan 속성을 갖는다.

[Fields]

- http://www.elasticsearch.org/guide/reference/api/search/fields.html

- 검색 결과에 화면에 보여주기 위한 field 를 정의 함

- 기본 store : yes 설정이 되어야 하면 no 라면 _source 값을 load 하게 됨.

[Script Fields]

- http://www.elasticsearch.org/guide/reference/api/search/script-fields/

- http://www.elasticsearch.org/guide/reference/modules/scripting/

- 쿼리 결과에 대한 별도의 연산 작업을 통해 지정한 field 명으로 연산된 값을 return 한다.

{
    "query" : {
        ...
    },
    "script_fields" : {
        "test1" : {
            "script" : "doc['my_field_name'].value * 2"
        },
        "test2" : {
            "script" : "doc['my_field_name'].value * factor",
            "params" : {
                "factor"  : 2.0
            }
        }
    } 
}

Its important to understand the difference between doc['my_field'].value and _source.my_field.

[Preference]

- http://www.elasticsearch.org/guide/reference/api/search/preference/

- 검색 요청 시 shard control 을 통해서 사용

[Term Facet]

- http://www.elasticsearch.org/guide/reference/api/search/facets/terms-facet/

{
    "query" : {
        "match_all" : {  }
    },
    "facets" : {
        "tag" : {
            "terms" : {
                "field" : "tag",
                "size" : 10,
                "order" : "term"
            }
        }
    }
}

- tag field 에서 가장 빈번한 term(size) 10 개, ordering 은 term 으로 수행.

- ordering option 은 count, term, reverse_count, reverse_term (기본 count)

[Range Facet]

- http://www.elasticsearch.org/guide/reference/api/search/facets/range-facet/

- range facet 을 사용하기 위해서는 IndexNumericFieldData(RangeFacetExecutor.java) 를 가져야 함.

- key and value 에서는 key 는 range 비교 항목으로 사용하고, value 는 비교된 결과에 aggregate 연산(total)을 함

- script key and value 사용 가능 함

{
    "query" : {
        "match_all" : {}
    },
    "facets" : {
        "range1" : {
            "range" : {
                "key_script" : "doc['date'].date.minuteOfHour",
                "value_script" : "doc['num1'].value",
                "ranges" : [
                    { "to" : 50 },
                    { "from" : 20, "to" : 70 },
                    { "from" : 70, "to" : 120 },
                    { "from" : 150 }
                ]
            }
        }
    }
}

[Statistical Facet]

Statistical facet allows to compute statistical data on a numeric fields. The statistical data include count, total, sum of squares, mean (average), minimum, maximum, variance, and standard deviation.

- numeric fields 에 대해서 통계적 데이터를 계산함.

- script field 와 multi field 지원 함.

[Term Stats Facet]

- term 과 statistical 조합

- size 속성의 기본 값은 10

- size 가 0 인 경우 전체 matching term 을 return 한다. (주의, 너무 많은 결과가 나올 수 있음)

- ordering 옵션은 아래와 같다.

term, reverse_term, count, reverse_count, total, reverse_total, min, reverse_min, max, reverse_max, mean, reverse_mean. Defaults to count.

:

[elasticsearch] Core API - Get/Multi Get/Update

Elastic/Elasticsearch 2013. 4. 16. 15:22

본 문서는 개인적인 테스트와 elasticsearch.org 그리고 community 등을 참고해서 작성된 것이며,

정보 교환이 목적입니다.

잘못된 부분에 대해서는 지적 부탁 드립니다.

(예시 코드는 성능 및 보안 검증이 되지 않았습니다.)

[elasticsearch 리뷰]

원문 링크

http://www.elasticsearch.org/guide/reference/api/get.html

http://www.elasticsearch.org/guide/reference/api/multi-get.html

http://www.elasticsearch.org/guide/reference/api/update.html

위 3개의 API 들은 Java API 를 참고하는게 더 좋습니다.

기본 Core API 에서 보여 주고 있는 예제는 REST API 로 kimchy 가 추천하는 방식과는 조금 다른 예제로 보여 주기 위한 목적이 강하다는 판단 입니다.

kimchy 의 추천은 Native Java API 를 사용할 것을 추천 합니다.

:

[elasticsearch] Core API - Delete

Elastic/Elasticsearch 2013. 4. 16. 15:03

본 문서는 개인적인 테스트와 elasticsearch.org 그리고 community 등을 참고해서 작성된 것이며,

정보 교환이 목적입니다.

잘못된 부분에 대해서는 지적 부탁 드립니다.

(예시 코드는 성능 및 보안 검증이 되지 않았습니다.)

[elasticsearch 리뷰]

원문 링크 : http://www.elasticsearch.org/guide/reference/api/delete/

Parent

- routing 설정과 동일하게 동작 함

Replication Type

- 기본 sync 로 동작 하면 async 설정 가능 함

- async 설정 시 primary shard 만 적용 되면 성공

:

[elasticsearch] Core API - Index

Elastic/Elasticsearch 2013. 4. 16. 14:56

본 문서는 개인적인 테스트와 elasticsearch.org 그리고 community 등을 참고해서 작성된 것이며,

정보 교환이 목적입니다.

잘못된 부분에 대해서는 지적 부탁 드립니다.

(예시 코드는 성능 및 보안 검증이 되지 않았습니다.)

[elasticsearch 리뷰]

원문 링크 : http://www.elasticsearch.org/guide/reference/api/index_/

Automatic Index Creation

- 자동으로 인덱스를 생성 하거나 field 를 생성 할수 있도록 하는 옵션

Versioning

- 본문에 나와 있듯이 read-then-update transaction 에 좋은 사례로 사용됨

- http://en.wikipedia.org/wiki/Optimistic_concurrency_control

Operation Type

- 강제로 create 하도록 설정 가능 함

Automatic ID Generation

- _id 에 해당 하는 값이 자동으로 생성 됨

Routing

- document 의 id 값을 hashing 해서 shard 에 저장함

Parents & Children

- indexing 시에 명시적으로 parent 를 이용해서 child document 를 색인 할 수 있다.

- _parent 참고

Timestamp

- mapping 설정에서 _tmestamp 참고

TTL

- 문서의 expiration 시간을 설정 함

- mapping 설정에서 _ttl 참고

Distributed

- 기본 primary shard 로 저장을 하는데 routing 설정을 통해서 분산 저장 설정을 함

Write Consistency

- 쓰기 동작에 대한 유효성 설정정

- success 조건은 quorum (>replicas/2+1)

Asynchronous Replication

- 기본 sync 방식으로 동작하나, async 설정이 가능 함.

- async 설정 시 primary shard 에 저장이 되면 성공 임.

Refresh

- 검색과 색인 관점에서 성능에 문제가 되지 않는지 검증 하고 설정 해야 함.

- true 설정 시 realtime 적용됨

Timeout

- 간혹 primary shard 가 응답하지 못할 수가 있기 때문에 timeout 값을 설정 함.

:

jjeong

'core api'에 해당되는 글 4건

[elasticsearch] Core API - Search

[elasticsearch] Core API - Get/Multi Get/Update

[elasticsearch] Core API - Delete

[elasticsearch] Core API - Index

티스토리툴바