pandas dataframe apply lambda index error

up vote
0
down vote

favorite

I have the following code

df2['TaxAccNo4'] = df2['TaxAccNo2'].apply(lambda x: x.split('.')[0])

df2['TaxAccNo3'] = df2['TaxAccNo2'].apply(lambda x: x.split('.')[1])

where df2 is:

     TaxAccNo2    

0    00001379.1   

1    00182218

When I run the code I get

     TaxAccNo2   TaxAccNo4

0    00001379.1  00001379

1    00182218    00182218

and IndexError: list index out of range for TaxAccNo3,

     TaxAccNo2   TaxAccNo4   TaxAccNo3

0    00001379.1  00001379    1

1    00182218    00182218

How do I fix my code to produce that output? I'm assuming its giving me the error because Index 1 doesn't have '.' but I'm not sure how to fix that.

edited Nov 21 at 1:11

eyllanesc

68.6k93052

asked Nov 21 at 1:09

Andy

275

add a comment |

up vote
0
down vote

favorite

I have the following code

df2['TaxAccNo4'] = df2['TaxAccNo2'].apply(lambda x: x.split('.')[0])

df2['TaxAccNo3'] = df2['TaxAccNo2'].apply(lambda x: x.split('.')[1])

where df2 is:

     TaxAccNo2    

0    00001379.1   

1    00182218

When I run the code I get

     TaxAccNo2   TaxAccNo4

0    00001379.1  00001379

1    00182218    00182218

and IndexError: list index out of range for TaxAccNo3,

     TaxAccNo2   TaxAccNo4   TaxAccNo3

0    00001379.1  00001379    1

1    00182218    00182218

How do I fix my code to produce that output? I'm assuming its giving me the error because Index 1 doesn't have '.' but I'm not sure how to fix that.

edited Nov 21 at 1:11

eyllanesc

68.6k93052

asked Nov 21 at 1:09

Andy

275

add a comment |

up vote
0
down vote

favorite

I have the following code

df2['TaxAccNo4'] = df2['TaxAccNo2'].apply(lambda x: x.split('.')[0])

df2['TaxAccNo3'] = df2['TaxAccNo2'].apply(lambda x: x.split('.')[1])

where df2 is:

     TaxAccNo2    

0    00001379.1   

1    00182218

When I run the code I get

     TaxAccNo2   TaxAccNo4

0    00001379.1  00001379

1    00182218    00182218

and IndexError: list index out of range for TaxAccNo3,

     TaxAccNo2   TaxAccNo4   TaxAccNo3

0    00001379.1  00001379    1

1    00182218    00182218

How do I fix my code to produce that output? I'm assuming its giving me the error because Index 1 doesn't have '.' but I'm not sure how to fix that.

edited Nov 21 at 1:11

eyllanesc

68.6k93052

asked Nov 21 at 1:09

Andy

275

I have the following code

df2['TaxAccNo4'] = df2['TaxAccNo2'].apply(lambda x: x.split('.')[0])

df2['TaxAccNo3'] = df2['TaxAccNo2'].apply(lambda x: x.split('.')[1])

where df2 is:

     TaxAccNo2    

0    00001379.1   

1    00182218

When I run the code I get

     TaxAccNo2   TaxAccNo4

0    00001379.1  00001379

1    00182218    00182218

and IndexError: list index out of range for TaxAccNo3,

     TaxAccNo2   TaxAccNo4   TaxAccNo3

0    00001379.1  00001379    1

1    00182218    00182218

How do I fix my code to produce that output? I'm assuming its giving me the error because Index 1 doesn't have '.' but I'm not sure how to fix that.

python pandas dataframe

edited Nov 21 at 1:11

eyllanesc

68.6k93052

asked Nov 21 at 1:09

Andy

275

edited Nov 21 at 1:11

eyllanesc

68.6k93052

asked Nov 21 at 1:09

Andy

275

edited Nov 21 at 1:11

eyllanesc

68.6k93052

edited Nov 21 at 1:11

eyllanesc

68.6k93052

edited Nov 21 at 1:11

eyllanesc

68.6k93052

asked Nov 21 at 1:09

Andy

275

asked Nov 21 at 1:09

Andy

275

asked Nov 21 at 1:09

Andy

275

add a comment |

2 Answers
2

active

oldest

votes

up vote
0
down vote

As you said, the problem is that "00182218".split(".") doesn't have a [1] index, since it's the list ["00182218"].

A simple solution without affecting too much the code is to use a ... if ... else ... ternary operator:

df2['TaxAccNo4'] = df2['TaxAccNo2'].apply(lambda x: x.split('.')[0])

df2['TaxAccNo3'] = df2['TaxAccNo2'].apply(lambda x: x.split('.')[1] if '.' in x else '')

Where the last '' is an empty string, the value with which you will fill 'TaxAccNo3' if 'TaxAccNo2' doesn't have a dot (you can replace it if you want other behaviour).

The semantic is: put x.split('.')[1] in df2['TaxAccNo3'] if x contains a dot, otherwise put an empty string.

answered Nov 21 at 1:19

Julian Peller

844411

New contributor

add a comment |

up vote
0
down vote

Hy, I was reviewing your code, the problem is that when you use the method split() in a string the returned object is a list, and this is causing the index error, as you pointed. The solution I encountered if very simple, use a conditional in your code to prevent it from calling this index for shorter lists as follows. Hope it helps.

df2['TaxAccNo3'] = df2['TaxAccNo2'].apply(lambda x: x.split('.')[1] if len(x.split('.'))>1 else x)

answered Nov 21 at 1:51

Pedro Moresco

113

New contributor

add a comment |

Your Answer

StackExchange.ifUsing("editor", function () {
StackExchange.using("externalEditor", function () {
StackExchange.using("snippets", function () {
StackExchange.snippets.init();
});
});
}, "code-snippets");

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "1"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});

}
});

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53403917%2fpandas-dataframe-apply-lambda-index-error%23new-answer', 'question_page');
}
);

Post as a guest

Name

Required, but never shown

2 Answers
2

active

oldest

votes

2 Answers
2

active

oldest

votes

up vote
0
down vote

As you said, the problem is that "00182218".split(".") doesn't have a [1] index, since it's the list ["00182218"].

A simple solution without affecting too much the code is to use a ... if ... else ... ternary operator:

df2['TaxAccNo4'] = df2['TaxAccNo2'].apply(lambda x: x.split('.')[0])

df2['TaxAccNo3'] = df2['TaxAccNo2'].apply(lambda x: x.split('.')[1] if '.' in x else '')

Where the last '' is an empty string, the value with which you will fill 'TaxAccNo3' if 'TaxAccNo2' doesn't have a dot (you can replace it if you want other behaviour).

The semantic is: put x.split('.')[1] in df2['TaxAccNo3'] if x contains a dot, otherwise put an empty string.

answered Nov 21 at 1:19

Julian Peller

844411

New contributor

add a comment |

up vote
0
down vote

As you said, the problem is that "00182218".split(".") doesn't have a [1] index, since it's the list ["00182218"].

A simple solution without affecting too much the code is to use a ... if ... else ... ternary operator:

df2['TaxAccNo4'] = df2['TaxAccNo2'].apply(lambda x: x.split('.')[0])

df2['TaxAccNo3'] = df2['TaxAccNo2'].apply(lambda x: x.split('.')[1] if '.' in x else '')

Where the last '' is an empty string, the value with which you will fill 'TaxAccNo3' if 'TaxAccNo2' doesn't have a dot (you can replace it if you want other behaviour).

The semantic is: put x.split('.')[1] in df2['TaxAccNo3'] if x contains a dot, otherwise put an empty string.

answered Nov 21 at 1:19

Julian Peller

844411

New contributor

add a comment |

up vote
0
down vote

As you said, the problem is that "00182218".split(".") doesn't have a [1] index, since it's the list ["00182218"].

A simple solution without affecting too much the code is to use a ... if ... else ... ternary operator:

df2['TaxAccNo4'] = df2['TaxAccNo2'].apply(lambda x: x.split('.')[0])

df2['TaxAccNo3'] = df2['TaxAccNo2'].apply(lambda x: x.split('.')[1] if '.' in x else '')

Where the last '' is an empty string, the value with which you will fill 'TaxAccNo3' if 'TaxAccNo2' doesn't have a dot (you can replace it if you want other behaviour).

The semantic is: put x.split('.')[1] in df2['TaxAccNo3'] if x contains a dot, otherwise put an empty string.

answered Nov 21 at 1:19

Julian Peller

844411

New contributor

As you said, the problem is that "00182218".split(".") doesn't have a [1] index, since it's the list ["00182218"].

A simple solution without affecting too much the code is to use a ... if ... else ... ternary operator:

df2['TaxAccNo4'] = df2['TaxAccNo2'].apply(lambda x: x.split('.')[0])

df2['TaxAccNo3'] = df2['TaxAccNo2'].apply(lambda x: x.split('.')[1] if '.' in x else '')

Where the last '' is an empty string, the value with which you will fill 'TaxAccNo3' if 'TaxAccNo2' doesn't have a dot (you can replace it if you want other behaviour).

The semantic is: put x.split('.')[1] in df2['TaxAccNo3'] if x contains a dot, otherwise put an empty string.

answered Nov 21 at 1:19

Julian Peller

844411

New contributor

answered Nov 21 at 1:19

Julian Peller

844411

New contributor

answered Nov 21 at 1:19

Julian Peller

844411

answered Nov 21 at 1:19

Julian Peller

844411

New contributor

Julian Peller is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.

add a comment |

up vote
0
down vote

df2['TaxAccNo3'] = df2['TaxAccNo2'].apply(lambda x: x.split('.')[1] if len(x.split('.'))>1 else x)

answered Nov 21 at 1:51

Pedro Moresco

113

New contributor

add a comment |

up vote
0
down vote

df2['TaxAccNo3'] = df2['TaxAccNo2'].apply(lambda x: x.split('.')[1] if len(x.split('.'))>1 else x)

answered Nov 21 at 1:51

Pedro Moresco

113

New contributor

add a comment |

up vote
0
down vote

df2['TaxAccNo3'] = df2['TaxAccNo2'].apply(lambda x: x.split('.')[1] if len(x.split('.'))>1 else x)

answered Nov 21 at 1:51

Pedro Moresco

113

New contributor

df2['TaxAccNo3'] = df2['TaxAccNo2'].apply(lambda x: x.split('.')[1] if len(x.split('.'))>1 else x)

answered Nov 21 at 1:51

Pedro Moresco

113

New contributor

answered Nov 21 at 1:51

Pedro Moresco

113

New contributor

answered Nov 21 at 1:51

Pedro Moresco

113

answered Nov 21 at 1:51

Pedro Moresco

113

New contributor

Pedro Moresco is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.

add a comment |

draft saved

draft discarded

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Name

Required, but never shown

Name

Required, but never shown

This page is only for reference, If you need detailed information, please check here

搜尋此網誌

Htykuut