'분류 전체보기' 카테고리의 글 목록 (83 Page)

[spark2] groupByKey를 쓰지 않도록 한다 (0)	2017.08.10
[spark2] mapPartitionWithIndex 예제 (0)	2017.08.10
[spark] [펌질] wide dependecy, narrow dependency (0)	2017.08.08
[spark2] partitonBy, HashPartitioner, RangePartitioner 예제 (0)	2017.08.07
[spark2] cache()와 persist()의 차이 (0)	2017.08.01

[cassandra3] commit log - Unexpected error deserializing mutation 에러 해결 (0)	2017.08.12
[cassandra] node local의 의미 (0)	2017.08.10
[cassandra3] schema 백업(backup)/복구(restore)하기 (0)	2017.08.08
[cassandra3] Cannot page queries with both ORDER BY and a IN restriction on the partition key; you must either remove the ORDER BY or the IN and sort client side, or disable paging for this query 해결하기 (0)	2017.08.08
[cassandra3] 복합 기본 키(compound primary key) (0)	2017.07.06

[cassandra3] schema 백업(backup)/복구(restore)하기

cassandra 2017. 8. 8. 19:33

전체 keyspace를 덤프뜨려면 다음과 같이 진행한다.

$ ./bin/cqlsh -e "desc schema"

CREATE KEYSPACE users WITH replication = {'class': 'SimpleStrategy', 'replication_factor': '1'} AND durable_writes = true;

CREATE TABLE users. follow_relation (

...

}

파일로 저장하려면 다음과 같이 진행한다.

$ ./bin/cqlsh -e "desc schema" > schema.cql

특정 keyspace만 파일로 저장하려면 다음과 같이 진행한다.

$ ./bin/cqlsh -e "desc keyspace my_status" > my_status.cql

$ cat schema.cql

CREATE KEYSPACE my_status WITH replication = {'class': 'SimpleStrategy', 'replication_factor': '1'} AND durable_writes = true;

CREATE TABLE my_status.follow_relation (

followed_username text,

follower_username text,

....

}

생성된 keyspace 파일을 import하는 방법은 cqlsh에 들어가서 source 명령을 사용하면 된다.

$./bin/cqlsh

Connected to Test Cluster at 127.0.0.1:9042.

[cqlsh 5.0.1 | Cassandra 3.10 | CQL spec 3.4.4 | Native protocol v4]

Use HELP for help.

cqlsh> source 'schema.cql'

cqlsh> use my_status;

cqlsh:my_status> describe my_status;

저작자표시

'cassandra' 카테고리의 다른 글

[cassandra] node local의 의미 (0)	2017.08.10
[cassandra3] select now() (0)	2017.08.09
[cassandra3] Cannot page queries with both ORDER BY and a IN restriction on the partition key; you must either remove the ORDER BY or the IN and sort client side, or disable paging for this query 해결하기 (0)	2017.08.08
[cassandra3] 복합 기본 키(compound primary key) (0)	2017.07.06
cassandra의 라이브러리를 사용한 UUID version1 테스트 (0)	2017.07.06

Posted by '김용환'

,

[spark] [펌질] wide dependecy, narrow dependency

scala 2017. 8. 8. 18:37

spark 코딩을 할 때 깊이 생각안하고 대충 짠 것을 후회했다. 그냥 동작만 되길 바라면서 했던 것들이 많이 기억났다.

spark의 coursera 강의 중 wide dependency와 narrow dependency에 대한 설명이 나오는데, 많은 영감을 주어서 잘 펌질해본다.

https://github.com/rohitvg/scala-spark-4/wiki/Wide-vs-Narrow-Dependencies

Transformations with (usually) Narrow dependencies:

map
mapValues
flatMap
filter
mapPartitions
mapPartitionsWithIndex

Transformations with (usually) Wide dependencies: (might cause a shuffle)

cogroup
groupWith
join
leftOuterJoin
rightOuterJoin
groupByKey
reduceByKey
combineByKey
distinct
intersection
repartition
coalesce

저작자표시

'scala' 카테고리의 다른 글

[spark2] mapPartitionWithIndex 예제 (0)	2017.08.10
[scala] Product 이해하기 (0)	2017.08.10
[spark2] partitonBy, HashPartitioner, RangePartitioner 예제 (0)	2017.08.07
[spark2] cache()와 persist()의 차이 (0)	2017.08.01
[scala] scalatest에서 Exception 처리 (0)	2017.07.27

Posted by '김용환'

,

[cassandra3] Cannot page queries with both ORDER BY and a IN restriction on the partition key; you must either remove the ORDER BY or the IN and sort client side, or disable paging for this query 해결하기

cassandra 2017. 8. 8. 15:29

카산드라(cassandra)에서 IN과 ORDER BY를 함께 싸용하면 다음과 같은 에러가 발생할 수 있다.

(참고로 ORDER BY 다음에는 클러스터링 키를 사용함으로서, 원하는 대로 파티션 키와 상관없이 생성 시간을 내림차순으로 결과를 얻을 수 있다)

InvalidRequest: Error from server: code=2200 [Invalid query] message="Cannot page queries with both ORDER BY and a IN restriction on the partition key; you must either remove the ORDER BY or the IN and sort client side, or disable paging for this query"

이 때에는 PAGING OFF라는 커맨드를 사용하면 에러가 발생하지 않고 정상적으로 동작한다.

저작자표시

'cassandra' 카테고리의 다른 글

[cassandra3] select now() (0)	2017.08.09
[cassandra3] schema 백업(backup)/복구(restore)하기 (0)	2017.08.08
[cassandra3] 복합 기본 키(compound primary key) (0)	2017.07.06
cassandra의 라이브러리를 사용한 UUID version1 테스트 (0)	2017.07.06
[cassandra] null의 개념 (0)	2017.07.03

Posted by '김용환'

,

[spark2] partitonBy, HashPartitioner, RangePartitioner 예제

scala 2017. 8. 7. 17:59

RDD에 partitonBy 메소드를 호출하면서 Partitioner를 정할 수 있다.

기본 Partitioner(https://spark.apache.org/docs/2.1.0/api/java/org/apache/spark/Partitioner.html)로는 HashPartitioner, RangePartitioner가 존재한다.

우선 HashPartitioner를 사용한다. 파티셔닝을 해쉬로 퍼트릴 수 있기 때문에 유용하다.

먼저 5개의 파티션으로 RDD를 생성했다가 Partitioning을 3개의 HashPartitioner를 사용하는 예제이다.

scala> val pairs = sc.parallelize(List((1, 1), (2, 2), (3, 3)), 5)

pairs: org.apache.spark.rdd.RDD[(Int, Int)] = ParallelCollectionRDD[1] at parallelize at <console>:24

scala> pairs.partitioner

res1: Option[org.apache.spark.Partitioner] = None

scala> import org.apache.spark.HashPartitioner

import org.apache.spark.HashPartitioner

scala> val partitioned = pairs.partitionBy(new HashPartitioner(3)).persist()

partitioned: org.apache.spark.rdd.RDD[(Int, Int)] = ShuffledRDD[3] at partitionBy at <console>:27

scala> partitioned.collect

res2: Array[(Int, Int)] = Array((2,2), (1,1), (3,3))

scala> pairs.partitions.length

res7: Int = 5

scala> partitioned.partitions.length

res8: Int = 3

scala> pairs.partitions

res5: Array[org.apache.spark.Partition] = Array(org.apache.spark.rdd.ParallelCollectionPartition@6ba, org.apache.spark.rdd.ParallelCollectionPartition@6bb, org.apache.spark.rdd.ParallelCollectionPartition@6bc, org.apache.spark.rdd.ParallelCollectionPartition@6bd, org.apache.spark.rdd.ParallelCollectionPartition@6be)

scala> partitioned.partitions

res6: Array[org.apache.spark.Partition] = Array(org.apache.spark.rdd.ShuffledRDDPartition@0, org.apache.spark.rdd.ShuffledRDDPartition@1, org.apache.spark.rdd.ShuffledRDDPartition@2)

persist()는 shuffle을 이미 되도록 해놓기 때문에 성능상 이점을 가진다. 실무에서 사용할 때 유용한 팁이다.

참고로 RDD.toDebugString() 메소드가 존재하는데 shuffle RDD인지 아닌지를 파악할 때 도움이 된다.

scala> partitioned.toDebugString

res11: String =

(3) ShuffledRDD[8] at partitionBy at <console>:27 [Memory Deserialized 1x Replicated]

| CachedPartitions: 3; MemorySize: 192.0 B; ExternalBlockStoreSize: 0.0 B; DiskSize: 0.0 B

+-(5) ParallelCollectionRDD[7] at parallelize at <console>:24 [Memory Deserialized 1x Replicated]

scala> pairs.toDebugString

res13: String = (5) ParallelCollectionRDD[7] at parallelize at <console>:24 []

다음은 RangePartitioner 예제이다. 내용은 비슷해보인다.

scala> import org.apache.spark.RangePartitioner

import org.apache.spark.RangePartitioner

scala> new RangePartitioner(3, pairs)

res9: org.apache.spark.RangePartitioner[Int,Int] = org.apache.spark.RangePartitioner@7d2d

scala> val rangePartitioned = pairs.partitionBy(new RangePartitioner(3, pairs)).persist()

rangePartitioned: org.apache.spark.rdd.RDD[(Int, Int)] = ShuffledRDD[8] at partitionBy at <console>:28

scala> rangePartitioned.collect

res10: Array[(Int, Int)] = Array((1,1), (2,2), (3,3))

scala> rangePartitioned.partitions.length

res11: Int = 3

RangePartitioner API(https://spark.apache.org/docs/2.1.0/api/java/org/apache/spark/RangePartitioner.html)를 살펴보면, ordering와 정렬순서(오름차순/내림차순)으로 할 수 있는 형태가 있다. HashPartitioner와 크게 다른 내용이라 할 수 있을 듯 싶다.

소스 : https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/Partitioner.scala

public RangePartitioner(int partitions,
                RDD<? extends scala.Product2<K,V>> rdd,
                boolean ascending,
                scala.math.Ordering<K> evidence$1,
                scala.reflect.ClassTag<K> evidence$2)

저작자표시

'scala' 카테고리의 다른 글

[scala] Product 이해하기 (0)	2017.08.10
[spark] [펌질] wide dependecy, narrow dependency (0)	2017.08.08
[spark2] cache()와 persist()의 차이 (0)	2017.08.01
[scala] scalatest에서 Exception 처리 (0)	2017.07.27
[scala] scalablitz (0)	2017.07.27

Posted by '김용환'

,

[elasticsearch] indices.fielddata.cache.expire 설정

Elasticsearch 2017. 8. 2. 20:21

일래스틱서치에 필드 캐시의 expire를 설정하는 옵션(indices.fielddata.cache.expire )이 1.x 버전에 있었지만 2.0부터는 사라졌다.

https://www.elastic.co/guide/en/elasticsearch/reference/1.4/index-modules-fielddata.html

indices.fielddata.cache.expire

[experimental] This functionality is experimental and may be changed or removed completely in a future release. Elastic will take a best effort approach to fix any issues, but experimental features are not subject to the support SLA of official GA features.A time based setting that expires field data after a certain time of inactivity. Defaults to -1. For example, can be set to 5m for a 5 minute expiry.

이 기능이 gc를 많이 유발하고 crash를 일으키는 이슈가 있어서 사라진 듯 하다..

https://discuss.elastic.co/t/indices-fielddata-cache-expire/1183

1.4에서는 잘 사용해서 문제가 없었지만. 결국 사라진 것으로 봐서는 큰 gc 이슈를 일으킨 것으로 보인다..

어차피 2.0에서 사라졌으니.. 히스토리를 위해서 남겨둔다.

저작자표시

'Elasticsearch' 카테고리의 다른 글

[elasticsearch] 쿼리 취소하기 (0)	2017.08.21
[elasticsearch5] thread pool status (0)	2017.08.18
[elasticsearch1.x] 메모리 구조 - 펌글 (0)	2017.08.02
[elasticsearch5] 핫 스레드 (hot thread) api (0)	2017.07.31
[elasticsearch5] 루씬 6.0의 유사도 모델 / 일래스틱서치의 유사도 모델 설정 방법 (0)	2017.07.30

Posted by '김용환'

,

[elasticsearch1.x] 메모리 구조 - 펌글

Elasticsearch 2017. 8. 2. 17:51

elasticsearch 1.x의 메모리 구조이다. 정말 잘 설명되어 있는 이미지가 있어서 펌한다.

https://kupczynski.info/2015/04/06/fielddata.html

좀 더 크게 보면 다음과 같다.

저작자표시

'Elasticsearch' 카테고리의 다른 글

[elasticsearch5] thread pool status (0)	2017.08.18
[elasticsearch] indices.fielddata.cache.expire 설정 (0)	2017.08.02
[elasticsearch5] 핫 스레드 (hot thread) api (0)	2017.07.31
[elasticsearch5] 루씬 6.0의 유사도 모델 / 일래스틱서치의 유사도 모델 설정 방법 (0)	2017.07.30
[elasticsearch5] phrase 쿼리에 사용할 수 있는 3가지 스무딩(smoothing) 모델 (0)	2017.07.29

Posted by '김용환'

,

[spark2] cache()와 persist()의 차이

scala 2017. 8. 1. 16:56

Spark에서는 연산할 때 스토리 레벨에 따라 지원하는 api, cache()와 persist()가 존재한다.

RDD에 cache를 저장한 예제를 살펴본다.

scala> val c = sc.parallelize(List("samuel"), 2)

c: org.apache.spark.rdd.RDD[String] = ParallelCollectionRDD[0] at parallelize at <console>:24

scala> c.getStorageLevel

res0: org.apache.spark.storage.StorageLevel = StorageLevel(1 replicas)

scala> c.cache

res1: c.type = ParallelCollectionRDD[0] at parallelize at <console>:24

scala> c.getStorageLevel

res2: org.apache.spark.storage.StorageLevel = StorageLevel(memory, deserialized, 1 replicas)

cache()는 기본 저장소 레벨이 MEMORY_ONLY로만으로 사용된다.

이번에는 persist() 예제를 진행한다. persist()는 다음 스토리지 레벨에 맞게 사용할 수 있다.

여기에서 SER은 serialized을 의미한다. disk 저장 위치는 로컬이다.

* 크게 분류된 스토리지 레벨(Storage Level)

Level	Space used	cpu time	In memory	On disk	Serialized
MEMORY_ONLY	High	Low	Y	N	N
MEMORY_ONLY_SER	Low	High	Y	N	Y
MEMORY_AND_DISK	High	Medium	Some	Some	Some
MEMORY_AND_DISK_SER	Low	High	Some	Some	Y
DISK_ONLY	Low	High	N	Y	Y

scala> import org.apache.spark.storage.StorageLevel;

import org.apache.spark.storage.StorageLevel

// 기존 c를 활용.

scala> c.persist(StorageLevel.MEMORY_ONLY_SER)

res4: c.type = ParallelCollectionRDD[1] at parallelize at <console>:24

scala> c.getStorageLevel

res5: org.apache.spark.storage.StorageLevel = StorageLevel(memory, 1 replicas)

scala> val c = sc.parallelize(List("samuel"), 2)

c: org.apache.spark.rdd.RDD[String] = ParallelCollectionRDD[2] at parallelize at <console>:25

scala> c.persist(StorageLevel.MEMORY_AND_DISK)

res7: c.type = ParallelCollectionRDD[2] at parallelize at <console>:25

scala> c.getStorageLevel

res8: org.apache.spark.storage.StorageLevel = StorageLevel(disk, memory, deserialized, 1 replicas)

참고로 rdd에 persist를 사용하고 다시 persist를 사용하면 에러가 발생한다.

scala> c.persist(StorageLevel.MEMORY_AND_DISK)

java.lang.UnsupportedOperationException: Cannot change storage level of an RDD after it was already assigned a level

at org.apache.spark.rdd.RDD.persist(RDD.scala:169)

at org.apache.spark.rdd.RDD.persist(RDD.scala:194)

... 48 elided

스토리지 레벨에 대한 공식 문서 내용은 다음과 같다.

https://spark.apache.org/docs/latest/rdd-programming-guide.html

Storage Level | Meaning

--------------------------------

MEMORY_ONLY | Store RDD as deserialized Java objects in the JVM. If the RDD does not fit in memory, some partitions will not be cached and will be recomputed on the fly each time they're needed. This is the default level.

MEMORY_AND_DISK | Store RDD as deserialized Java objects in the JVM. If the RDD does not fit in memory, store the partitions that don't fit on disk, and read them from there when they're needed.

MEMORY_ONLY_SER | Store RDD as serialized Java objects (one byte array per partition). This is generally more space-efficient than deserialized objects, especially when using a fast serializer, but more CPU-intensive to read.

MEMORY_AND_DISK_SER | Similar to MEMORY_ONLY_SER, but spill partitions that don't fit in memory to disk instead of recomputing them on the fly each time they're needed.

DISK_ONLY | Store the RDD partitions only on disk.

MEMORY_ONLY_2, MEMORY_AND_DISK_2, etc. | Same as the levels above, but replicate each partition on two cluster nodes.

OFF_HEAP (experimental) | Similar to MEMORY_ONLY_SER, but store the data in off-heap memory. This requires off-heap memory to be enabled.

원본 데이터 저장과 serialized의 성능 차이는 어느 개발자가 써놓은 내용이 있다..

출처 : http://sujee.net/wp-content/uploads/2015/01/spark-caching-1.png

또한, persist를 잘못 사용하면 애플리케이션이 종료해도 메모리를 계속 사용하는 문제가 발생할 수 있다고 한다.

출처 : http://tomining.tistory.com/84

따라서 애플리케이션 내에서 persist()호출이 된 rdd에 unpersist()를 호출해야 한다.!

scala> c.unpersist()

res10: c.type = ParallelCollectionRDD[2] at parallelize at <console>:25

저작자표시

'scala' 카테고리의 다른 글

[spark] [펌질] wide dependecy, narrow dependency (0)	2017.08.08
[spark2] partitonBy, HashPartitioner, RangePartitioner 예제 (0)	2017.08.07
[scala] scalatest에서 Exception 처리 (0)	2017.07.27
[scala] scalablitz (0)	2017.07.27
[scala] 병렬 콜렉션 (par collection) (0)	2017.07.24

Posted by '김용환'

,

[elasticsearch5] 핫 스레드 (hot thread) api

Elasticsearch 2017. 7. 31. 19:08

핫 스레드 API는 여러 정보를 포함한 형태를 가진 텍스트로 리턴한다. 즉 JSON 구조로 리턴하지 않는 형태를 갖고 있다.

응답 구조 자체에 대해 설명하기 전에 핫 스레드 API의 응답을 생성하는 로직을 짧게 소개한다.

일래스틱서치는 먼저 실행 중인 모든 스레드를 얻은 후 각 스레드에서 소비한 CPU 시간, 특정 스레드가 차단되었거나 대기 상태에 있었던 횟수, 차단된 시간 또는 대기 상태에 있었던 시간 등에 대한 다양한 정보를 수집한다.

다음에는 특정 시간(interval 매개 변수로 지정) 동안 기다린 후 시간이 지나면 동일한 정보를 다시 수집한다.

이 작업이 완료되면 각 특정 스레드가 실행되고 있는 시간에 따라 스레드가 정렬된다. 가장 오랜 기간 실행 중인 스레드가 목록 맨 위에 오도록 내림차순으로 정렬된다.

(이전에 언급된 시간은 type 매개 변수에 지정된 오퍼레이션 타입을 기반으로 측정된다. )

그 다음 일래스틱서치는 첫 번째 N개의 스레드(N은 threads 매개 변수로 지정된 스레드 개수)를 분석한다.

일래스틱서치는 몇 밀리 초마다 이전 단계에서 선택한 스레드의 스택 트레이스(stack trace)의 일부 스냅샷(스냅 샷 수는 스냅 샷 매개 변수로 지정)을 사용한다.

마지막으로 해야 할 일은 스레드 상태의 변경을 시각화하고, 호출 함수에게 응답을 리턴하기 위해 스택 트레이스를 그룹핑하는 것이다.

threads 개수는 기본 3개이고 간격은 500ms이며 type의 기본 값은 cpu이다.

간단한 예제를 보면 다음과 같다.

$ curl 'localhost:9200/_nodes/hot_threads?type=wait&interval=1s'

::: {5OEGj_a}{5OEGj_avT8un0nOak28qQg}{DAzM0ktKQNS047ggd9nYZQ}{127.0.0.1}{127.0.0.1:9300}

Hot threads at 2017-07-31T11:04:59.943Z, interval=1s, busiestThreads=3, ignoreIdleThreads=true:

8.4% (35.1ms out of 1000ms) cpu usage by thread 'elasticsearch[kemi][search][T#2]'

10/10 snapshots sharing following 8 elements

sun.misc.Unsafe.park(Native Method)

java.util.concurrent.locks.LockSupport.park(LockSupport.java:175)

java.util.concurrent.locks.AbstractQueuedSynchronizer.parkAndCheckInterrupt(AbstractQueuedSynchronizer.java:836)

java.util.concurrent.locks.AbstractQueuedSynchronizer.doAcquireSharedInterruptibly(AbstractQueuedSynchronizer.java:997)

java.util.concurrent.locks.AbstractQueuedSynchronizer.acquireSharedInterruptibly(AbstractQueuedSynchronizer.java:1304)

java.util.concurrent.CountDownLatch.await(CountDownLatch.java:231)

org.elasticsearch.bootstrap.Bootstrap$1.run(Bootstrap.java:84)

java.lang.Thread.run(Thread.java:745)

....

결과의 첫 부분을 보면..

핫 스레드 API 정보를 리턴하는 노드가 어느 노드인지 쉽게 알 수 있고 핫 스레드 API 호출이 언제 많은 노드로 전달되는 시점을 알 수 있다.

두 번째 부분은

8.4% (35.1ms out of 1000ms) cpu usage by thread 'elasticsearch[kemi][search][T#2]'

해당 스레드는 측정이 완료된 시점의 모든 CPU 시간 중 8.4%를 차지함을 알 수 있다.

cpu usage 부분은 cpu와 동일한 type을 사용하고 있음을 나타낸다 (여기에서 예상할 수 있는 다른 값은 블럭(block) 상태에 있는 스레드의 블럭 사용량(block usage)와 대기 상태에 있는 스레드의 대기 사용량(wait usage)이다). 스레드 이름은 여기에서 매우 중요하다.

스레드를 살펴보면 해당 일래스틱서치 스레드가 가장 핫한 스레드임을 알 수 있다. 이 예제의 핫 스레드가 모두 검색(search 값)이라는 것을 알 수 있다.

볼 수 있는 다른 값으로는 recovery_stream(복구 모듈 이벤트), cache(이벤트 캐시), merge(세그먼트 병합), index(데이터 저장 스레드) 등이 있다.

관련 내용은 다음 코드를 확인한다.

https://github.com/elastic/elasticsearch/blob/v5.2.1/core/src/main/java/org/elasticsearch/action/admin/cluster/node/hotthreads/NodesHotThreadsRequest.java

public class NodesHotThreadsRequest extends BaseNodesRequest<NodesHotThreadsRequest> {

int threads = 3;

String type = "cpu";

TimeValue interval = new TimeValue(500, TimeUnit.MILLISECONDS);

int snapshots = 10;

boolean ignoreIdleThreads = true;

// for serialization

public NodesHotThreadsRequest() {

}

/**

* Get hot threads from nodes based on the nodes ids specified. If none are passed, hot

* threads for all nodes is used.

*/

public NodesHotThreadsRequest(String... nodesIds) {

super(nodesIds);

}

public int threads() {

return this.threads;

}

public NodesHotThreadsRequest threads(int threads) {

this.threads = threads;

return this;

}

public boolean ignoreIdleThreads() {

return this.ignoreIdleThreads;

}

public NodesHotThreadsRequest ignoreIdleThreads(boolean ignoreIdleThreads) {

this.ignoreIdleThreads = ignoreIdleThreads;

return this;

}

public NodesHotThreadsRequest type(String type) {

this.type = type;

return this;

}

public String type() {

return this.type;

}

public NodesHotThreadsRequest interval(TimeValue interval) {

this.interval = interval;

return this;

}

public TimeValue interval() {

return this.interval;

}

public int snapshots() {

return this.snapshots;

}

public NodesHotThreadsRequest snapshots(int snapshots) {

this.snapshots = snapshots;

return this;

}

@Override

public void readFrom(StreamInput in) throws IOException {

super.readFrom(in);

threads = in.readInt();

ignoreIdleThreads = in.readBoolean();

type = in.readString();

interval = new TimeValue(in);

snapshots = in.readInt();

}

@Override

public void writeTo(StreamOutput out) throws IOException {

super.writeTo(out);

out.writeInt(threads);

out.writeBoolean(ignoreIdleThreads);

out.writeString(type);

interval.writeTo(out);

out.writeInt(snapshots);

}

저작자표시

'Elasticsearch' 카테고리의 다른 글

[elasticsearch] indices.fielddata.cache.expire 설정 (0)	2017.08.02
[elasticsearch1.x] 메모리 구조 - 펌글 (0)	2017.08.02
[elasticsearch5] 루씬 6.0의 유사도 모델 / 일래스틱서치의 유사도 모델 설정 방법 (0)	2017.07.30
[elasticsearch5] phrase 쿼리에 사용할 수 있는 3가지 스무딩(smoothing) 모델 (0)	2017.07.29
[elasticsearch5] 집계 (aggregation) 성능 향상 (0)	2017.07.26

Posted by '김용환'

,

'분류 전체보기'에 해당되는 글 4074건

[scala] Product 이해하기

'scala' 카테고리의 다른 글

[cassandra3] select now()

'cassandra' 카테고리의 다른 글

[cassandra3] schema 백업(backup)/복구(restore)하기

'cassandra' 카테고리의 다른 글

[spark] [펌질] wide dependecy, narrow dependency

'scala' 카테고리의 다른 글

[cassandra3] Cannot page queries with both ORDER BY and a IN restriction on the partition key; you must either remove the ORDER BY or the IN and sort client side, or disable paging for this query 해결하기

'cassandra' 카테고리의 다른 글

[spark2] partitonBy, HashPartitioner, RangePartitioner 예제

'scala' 카테고리의 다른 글

[elasticsearch] indices.fielddata.cache.expire 설정

'Elasticsearch' 카테고리의 다른 글

[elasticsearch1.x] 메모리 구조 - 펌글

'Elasticsearch' 카테고리의 다른 글

[spark2] cache()와 persist()의 차이

'scala' 카테고리의 다른 글

[elasticsearch5] 핫 스레드 (hot thread) api

'Elasticsearch' 카테고리의 다른 글

카테고리

태그목록

최근에 올라온 글

최근에 달린 댓글

최근에 받은 트랙백

글 보관함

달력

링크

티스토리툴바