Perl 해시의 키를 반복하는 가장 안전한 방법은 무엇입니까?
여러 (키, 값) 쌍이있는 Perl 해시가있는 경우 모든 키를 반복하는 선호하는 방법은 무엇입니까? 나는 each
어떤 식 으로든 사용 하면 의도하지 않은 부작용이 발생할 수 있다고 들었습니다 . 그렇다면 그게 사실이며 다음 두 가지 방법 중 하나가 가장 좋거나 더 나은 방법이 있습니까?
# Method 1
while (my ($key, $value) = each(%hash)) {
# Something
}
# Method 2
foreach my $key (keys(%hash)) {
# Something
}
경험상 필요에 가장 적합한 기능을 사용하는 것이 좋습니다.
키만 원하고 값을 읽지 않으 려면 keys ()를 사용하십시오.
foreach my $key (keys %hash) { ... }
값만 원하면 values ()를 사용하십시오.
foreach my $val (values %hash) { ... }
키 와 값 이 필요하면 each ()를 사용하세요.
keys %hash; # reset the internal iterator so a prior each() doesn't affect the loop
while(my($k, $v) = each %hash) { ... }
반복 중에 현재 키를 삭제하는 것을 제외하고 어떤 방식 으로든 해시의 키를 변경하려는 경우 each ()를 사용하지 않아야합니다. 예를 들어, 값이 두 배인 새로운 대문자 키 세트를 만드는이 코드는 keys ()를 사용하여 잘 작동합니다.
%h = (a => 1, b => 2);
foreach my $k (keys %h)
{
$h{uc $k} = $h{$k} * 2;
}
예상되는 결과 해시 생성 :
(a => 1, A => 2, b => 2, B => 4)
그러나 each ()를 사용하여 동일한 작업을 수행합니다.
%h = (a => 1, b => 2);
keys %h;
while(my($k, $v) = each %h)
{
$h{uc $k} = $h{$k} * 2; # BAD IDEA!
}
예측하기 어려운 방식으로 잘못된 결과를 생성합니다. 예를 들면 :
(a => 1, A => 2, b => 2, B => 8)
그러나 이것은 안전합니다.
keys %h;
while(my($k, $v) = each %h)
{
if(...)
{
delete $h{$k}; # This is safe
}
}
이 모든 것은 perl 문서에 설명되어 있습니다.
% perldoc -f keys
% perldoc -f each
One thing you should be aware of when using each
is that it has the side effect of adding "state" to your hash (the hash has to remember what the "next" key is). When using code like the snippets posted above, which iterate over the whole hash in one go, this is usually not a problem. However, you will run into hard to track down problems (I speak from experience ;), when using each
together with statements like last
or return
to exit from the while ... each
loop before you have processed all keys.
In this case, the hash will remember which keys it has already returned, and when you use each
on it the next time (maybe in a totaly unrelated piece of code), it will continue at this position.
Example:
my %hash = ( foo => 1, bar => 2, baz => 3, quux => 4 );
# find key 'baz'
while ( my ($k, $v) = each %hash ) {
print "found key $k\n";
last if $k eq 'baz'; # found it!
}
# later ...
print "the hash contains:\n";
# iterate over all keys:
while ( my ($k, $v) = each %hash ) {
print "$k => $v\n";
}
This prints:
found key bar
found key baz
the hash contains:
quux => 4
foo => 1
What happened to keys "bar" and baz"? They're still there, but the second each
starts where the first one left off, and stops when it reaches the end of the hash, so we never see them in the second loop.
The place where each
can cause you problems is that it's a true, non-scoped iterator. By way of example:
while ( my ($key,$val) = each %a_hash ) {
print "$key => $val\n";
last if $val; #exits loop when $val is true
}
# but "each" hasn't reset!!
while ( my ($key,$val) = each %a_hash ) {
# continues where the last loop left off
print "$key => $val\n";
}
If you need to be sure that each
gets all the keys and values, you need to make sure you use keys
or values
first (as that resets the iterator). See the documentation for each.
Using the each syntax will prevent the entire set of keys from being generated at once. This can be important if you're using a tie-ed hash to a database with millions of rows. You don't want to generate the entire list of keys all at once and exhaust your physical memory. In this case each serves as an iterator whereas keys actually generates the entire array before the loop starts.
So, the only place "each" is of real use is when the hash is very large (compared to the memory available). That is only likely to happen when the hash itself doesn't live in memory itself unless you're programming a handheld data collection device or something with small memory.
If memory is not an issue, usually the map or keys paradigm is the more prevelant and easier to read paradigm.
A few miscellaneous thoughts on this topic:
- There is nothing unsafe about any of the hash iterators themselves. What is unsafe is modifying the keys of a hash while you're iterating over it. (It's perfectly safe to modify the values.) The only potential side-effect I can think of is that
values
returns aliases which means that modifying them will modify the contents of the hash. This is by design but may not be what you want in some circumstances. - John's accepted answer is good with one exception: the documentation is clear that it is not safe to add keys while iterating over a hash. It may work for some data sets but will fail for others depending on the hash order.
- As already noted, it is safe to delete the last key returned by
each
. This is not true forkeys
aseach
is an iterator whilekeys
returns a list.
I always use method 2 as well. The only benefit of using each is if you're just reading (rather than re-assigning) the value of the hash entry, you're not constantly de-referencing the hash.
I may get bitten by this one but I think that it's personal preference. I can't find any reference in the docs to each() being different than keys() or values() (other than the obvious "they return different things" answer. In fact the docs state the use the same iterator and they all return actual list values instead of copies of them, and that modifying the hash while iterating over it using any call is bad.
All that said, I almost always use keys() because to me it is usually more self documenting to access the key's value via the hash itself. I occasionally use values() when the value is a reference to a large structure and the key to the hash was already stored in the structure, at which point the key is redundant and I don't need it. I think I've used each() 2 times in 10 years of Perl programming and it was probably the wrong choice both times =)
I usually use keys
and I can't think of the last time I used or read a use of each
.
Don't forget about map
, depending on what you're doing in the loop!
map { print "$_ => $hash{$_}\n" } keys %hash;
I woudl say:
- Use whatever's easiest to read/understand for most people (so keys, usually, I'd argue)
- Use whatever you decide consistently throught the whole code base.
This give 2 major advantages:
- It's easier to spot "common" code so you can re-factor into functions/methiods.
- It's easier for future developers to maintain.
I don't think it's more expensive to use keys over each, so no need for two different constructs for the same thing in your code.
'developer tip' 카테고리의 다른 글
ipa 파일에 다시 서명하는 방법은 무엇입니까? (0) | 2020.08.17 |
---|---|
스칼라에서 InputStream을 문자열로 변환하는 관용적 방법 (0) | 2020.08.17 |
SQL-처음 10 개 행만 선택 하시겠습니까? (0) | 2020.08.17 |
PHP를 사용하여 JSON 파일에서 데이터 가져 오기 (0) | 2020.08.17 |
파이썬 셸에서 pyspark 가져 오기 (0) | 2020.08.17 |