[scala] filter, exists 성능 주의

scala 2016. 9. 29. 19:19

검색하다보니 filter와 exists 컬렉션 API에 대한 성능 주의 사항이 좀 있다.

그선.. 스크랩을 해둔다.

1. filter

https://www.sumologic.com/blog-technology/3-tips-for-writing-performant-scala/

Using lazy collections must be taken with a grain of salt — while lazy collections often can improve performance, they can also make it worse. For example:

def nonview = (1 to 5000000).map(_ % 10).filter(_ > 5).reduce(_ + _)
def view = (1 to 5000000).view.map(_ % 10).filter(_ > 5).reduce(_ + _)

view raw gistfile1.scala hosted with ❤ by GitHub

For this microbenchmark, the lazy version ran 1.5x faster than the strict version. However, for smaller values of n, the strict version will run faster. Lazy evaluation requires the creation of an additional closure. If creating the closures takes longer than creating intermediate collections, the lazy version will run slower. Profile and understand your bottlenecks before optimizing!

filter가 새로운 콜렉션을 내부적으로 생성하기 때문에

view.filter를 사용하면 lazy 코드로 1.5배 빠르다고 한다.

또한, filter는 컬렉션을 모두 순회하는 linear time의 오퍼레이션이다.

2. exists

http://stackoverflow.com/questions/16443177/scala-which-data-structures-are-optimal-in-which-siutations-when-using-contai

With exists, you really just care about how fast the collection is to traverse--you have to traverse everything anyway. There, List is usually the champ (unless you want to traverse an array by hand), but only Set and so on are usually particularly bad (e.g. exists on List is ~8x faster than on a Set when each have 1000 elements). The others are within about 2.5x of List(usually 1.5x, but Vector has an underlying tree structure which is not all that fast to traverse).

exist는 컬렉션을 모두 순회하는 선형 시간의 오퍼레이션이다. 그런데, List.exists가 Set.exists보다 8배 이상 빠르다고 한다. 다른 컬렉션보다 Lists.exists가 1.5배에서 2.5배 빠르다고 한다.

저작자표시

'scala' 카테고리의 다른 글

스칼라의 대수적 자료형(Algebraic Data Types, ADT), exhaustivity-checking (0)	2016.10.04
[scala] 이름에 의한 호출 매개변수 (by-name parameter) (0)	2016.09.29
[scala] Odering/Ordered (sorted, sortBy, sortWith), TreeMap (0)	2016.09.29
[scala] @ 연산자 (0)	2016.09.28
[scala] vararg, _* 타입 어노테이션 (0)	2016.09.28

Posted by '김용환'

일	월	화	수	목	금	토
	1	2	3	4	5	6
7	8	9	10	11	12	13
14	15	16	17	18	19	20
21	22	23	24	25	26	27
28	29	30

[scala] filter, exists 성능 주의

'scala' 카테고리의 다른 글

카테고리

태그목록

최근에 올라온 글

최근에 달린 댓글

최근에 받은 트랙백

글 보관함

달력

링크

티스토리툴바

	def nonview = (1 to 5000000).map(_ % 10).filter(_ > 5).reduce(_ + _)
	def view = (1 to 5000000).view.map(_ % 10).filter(_ > 5).reduce(_ + _)